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Preface 


Research in physics, conducted at the end of the 19"" century and in the Some Preliminary Remarks 
first half of the 20" century, revealed exceptionally peculiar nature of the 

laws governing the behaviour of microparticles-atoms, electrons, and so 

on. On the basis of this research a new physical theory called quantum 


mechanics was founded. 


The growth of quantum mechanics turned out to be quite complicated 
and prolonged. The mathematical part of the theory, and the rules linking 
the theory with experiment, were constructed relatively quickly (by the 
beginning of the thirties). However, the understanding of the physical and 
philosophical substance of the mathematical symbols used in the theory 


was unresolved for decades. In Fock’s words, 


The mathematical apparatus of non-relativistic quantum mechanics 
worked well and was free of contradictions; but in spite of many 
successful applications to different problems of atomic physics the 
physical representation of the mathematical scheme still remained a 


roblem to be solved. ! 1 Fock, V. A. (1957). On the interpreta- 

P P 
tion of quantum mechanics. Czechoslovak 
Journal of Physics, 7:643-656 


Many difficulties are involved in a mathematical interpretation of the 
quantum-mechanical apparatus. These are associated with the dialectics 
of the new laws, the radical revision of the very nature of the questions 
which a physicist “is entitled to put to nature”, the reinterpretation of 
the role of the observer vis-d-vis his surroundings, the new approach to 
the question of the relation between chance and necessity in physical 
phenomena, and the rejection of many accepted notions and concepts. 
Quantum mechanics was born in an atmosphere of discussions and heated 
clashes between contradictory arguments. The names of many leading 
scientists are linked with its development, including N. Bohr, A. Einstein, 
M. Planck, E. Schrédinger, M. Born, W. Pauli, A. Sommerfeld, L. de Broglie, 
P. Ehrenfest, E. Fermi, W. Heisenberg, P. Dirac, R. Feynman, and others. 


It is also not surprising that even today anyone who starts studying 


quantum mechanics encounters some sort of psychological barrier. This 
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is not because of the mathematical complexity. The difficulty arises from 
the fact that it is difficult to break away from accepted concepts and 
to reorganize one’s pattern of thinking which are based on everyday 


experience. 


Before starting a study of quantum mechanics, it is worthwhile getting 
an idea about its place and role in physics. We shall consider (naturally in 


the most general terms) the following three questions: 


§ What is quantum mechanics? 
§ What is the relation between classical physics and quantum mechanics? 


§ What specialists need quantum mechanics? 
So, what is quantum mechanics? 


The question can be answered in different ways. First and foremost, 
quantum mechanics is a theory describing the properties of matter at the 
level of microphenomena - it considers the laws of motion of microparti- 
cles. Microparticles (molecules, atoms, elementary particles) are the main 


“characters” in the drama of quantum mechanics. 


From a broader point of view quantum mechanics should be treated as 
the theoretical foundation of the modern theory of the structure and prop- 
erties of matter. In comparison with classical physics, quantum mechanics 
considers the properties of matter on a deeper and more fundamental level. 
It provides answers to many questions which remained unsolved in classi- 
cal physics. For example, why is a diamond hard? Why does the electrical 
conductivity of a semiconductor increase with temperature? Why does a 
magnet lose its properties upon heating? Unable to get answers from clas- 
sical physics to these questions, we turn to quantum mechanics. Finally, 
it must be emphasized that quantum mechanics allows one to calculate 
many physical parameters of substances. Answering the question “What 


is quantum mechanics?”, Lamb remarked: 


The only easy one (answer) is that quantum mechanics is a disci- 
pline that provides a wonderful set of rules for calculating physical 


properties of matter.” 


What is the relation of quantum mechanics to classical physics? First 

of all quantum mechanics includes classical mechanics as a limiting (ex- 
treme) case. Upon a transition from microparticles to macroscopic bodies, 
quantum-mechanical laws are converted into the laws of classical me- 
chanics. Because of this it is often stated, though not very accurately, 
that quantum mechanics “works” in the microworld and the classical me- 
chanics, in the macroworld. This statement assumes the existence of an 


isolated “microworld” and an isolated “macroworld”. In actual practice 


2Lamb Jr, W. E. (1969). An operational 
interpretation of nonrelativistic quantum 
mechanics. Physics Today, 22:23-28 


we can only speak of microparticles (microphenomena) and macroscopic 
bodies (macrophenomena). It is also significant that microphenomena 
form the basis of macrophenomena and that macroscopic bodies are made 
up of microparticles. Consequently, the transition from classical physics 
to quantum mechanics is a transition not from one “world” to another, 
but from a shallower to a deeper level of studying matter. This means 
that in studying the behaviour of microparticles, quantum mechanics con- 
siders in fact the same macroparticles, but on a more fundamental level. 
Besides, it must be remembered that the boundary between micro- and 
macro- phenomena in general is quite conditional and flexible. Classical 
concepts are frequently found useful when considering microphenomena, 
while quantum-mechanical ideas help in the understanding of macrophe- 
nomena. There is even a special term “quantum macrophysics” which 

is applied, in particular, to quantum electronics, to the phenomena of 


superfluidity and superconductivity and to a number of other cases. 


In answering the question as to what specialists need quantum me- 
chanics, we mention beforehand that we have in mind specialists training 
in engineering colleges. There are at least three branches of engineering 
for which a study of quantum mechanics is absolutely essential. Firstly, 
there is the field of nuclear power and the application of radioactive iso- 
topes to industry. Secondly, the field of materials sciences (improvement 
of properties of materials, preparation of new materials with preassigned 
properties). Thirdly, the field of electronics and first of all the field of 
semiconductors and laser technology. If we consider that today almost 
any branch of industry uses new materials as well as electronics on a 
large scale, it will become clear that a comprehensive training in engineer- 


ing is impossible without a serious study of quantum mechanics. 


The aim of this book is to acquaint the reader with the concepts and 
ideas of quantum mechanics and the physical properties of matter; to 
reveal the logic of its new ideas, to show how these ideas are embodied 
in the mathematical apparatus of linear operators and to demonstrate the 
working of this apparatus using a number of examples and problems of 


interest to engineering students. 


The book consists of three chapters. By way of an introduction to 
quantum mechanics, the first chapter includes a study of the physics of 
microparticles. Special attention has been paid to the fundamental ideas 
of quantization and duality as well as to the uncertainty relations. The 
first chapter aims at “introducing” the main “character”, i.e. the micropar- 
ticle, and at showing the necessity of rejecting a number of concepts of 


classical physics. 


The second chapter deals with the physical concepts of quantum me- 


chanics. The chapter starts with an analysis of a set of basic experiments 


The Structure of the Book 
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which form a foundation for a system of quantum-mechanical ideas. This 
system is based on the concept of the amplitude of transition probability. 
The rules for working with amplitudes are demonstrated on the basis of 

a number of examples, the interference of amplitudes being the most im- 
portant. The principle of superposition and the measurement process are 
considered. This concludes the first stage in the discussion of the physical 
foundation of the theory. In the second stage an analysis is given based 

on amplitude concepts of the problems of causality in quantum mechanics. 
The Hamiltonian matrix is introduced while considering causality and its 
role is illustrated using examples involving microparticles with two basic 
states, with emphasis on the example of an electron in a magnetic field. 
The chapter concludes with a section of a general physical and philosophi- 


cal nature. 


The third chapter deals with the application of linear operators in 
the apparatus of quantum mechanics. At the beginning of the chapter 
the required mathematical concepts from the theory of Hermitian and 
unitary linear operators are introduced. It is then shown how the physical 
ideas can be “knitted” to the mathematical symbols, thus changing the 
apparatus of operator theory into the apparatus of quantum theory. The 
main features of this apparatus are further considered in a concrete form 
in the framework of the coordinate representation. The transition from 
the coordinate to the momentum representation is illustrated. Three ways 
of describing the evolution of microsystems in time, corresponding to the 
Schrédinger, Heisenberg and Dirac representation, have been discussed. A 
number of typical problems are considered to demonstrate the working of 
the apparatus; particular attention is paid to the problems of the motion of 
an electron in a periodic field and to the calculation of the probability of a 


quantum transition. 


The book contains a number of interludes. These are dialogues in 
which the author has allowed himself free and easy style of considering 
certain questions. The author was motivated to include interludes in the 
book by the view that one need not take too serious an attitude when 
studying serious subjects. And yet the reader should take the interludes 
fairly seriously. They are intended not so much for mental relaxation, 
as for helping the reader with fairly delicate questions, which can be 


understood best through a flexible dialogue treatment. 


Finally, the book contains many quotations. The author is sure that 
the “original words” of the founders of quantum mechanics will offer the 


reader useful additional information. 


The author wishes to express his deep gratitude to Prof. I.I. Gurevich, 
Corresponding Member of the USSR Academy of Sciences, for the stim- 


ulating discussions which formed the basis of this book. Prof. Gurevich 


Personal Remarks 


discussed the plan of the book and its preliminary drafts, and was kind 
enough to go through tho manuscript. His advice not only helped mould 
the structure of the book, but also helped in the nature of exposition of 
the material. The subsection “The Essence of Quantum Mechanics” in 


Section 16 is a direct consequence of Prof. Gurevich’s ideas. 


The author would like to record the deep impression left on him by 
the works on quantum mechanics by the leading American physicist R. 
Feynman.’ While reading the sections in this book dealing with the ap- 
plications of the idea of probability amplitude, superposition principle, 
microparticles with two basic states, the reader can easily detect a definite 
similarity in approach with the corresponding parts in Feynman’s “Lec- 
tures in Physics”. The author was also considerably influenced by N. Bohr 
(in particular by his wonderful essays Atomic Physics and Human Knowl- 
edge)’, V. A. Fock?, W. Pauli®, P. Dirac’, and also by the comprehensive 
works of L. D. Landau and E. M. Lifshitz®, D. I. Blokhintsev’, E. Fermi!®, L. 
Schiff". 


The author is especially indebted to Prof. M. I. Podgoretsky, D.Sc., for 
a thorough and extremely useful analysis of the manuscript. He is also 
grateful to Prof. Yu. A. Vdovin, Prof. E. E. Lovetsky, Prof. G. F. Drukarev, 
Prof. V. A. Dyakov, Prof. Yu. N. Pchelnikov, and Dr. A. M. Polyakov, all 
of whom took the trouble of going through the manuscript and made a 
number of valuable comments. Lastly, the author is indebted to his wife 
Aldina Tarasova for her constant interest in the writing of the book and 
her help in the preparation of the manuscript. But for her efforts, it would 


have been impossible to bring the book to its present form. 
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Prelude: Can the System of 
Classical Physics Concepts Be 


Considered Logically Perfect? 


He who would study organic existence, 
First drives out the soul with rigid persistence, 
Then the parts in his hands he may hold and class, 
But the spiritual link is lost, alas! 


Goethe (Faust) 


AUTHOR: It is well known that the basic contents of a physical the- 
ory are formed by a system of concepts which reflect the objective 
laws of nature within the framework of the given theory. Let us 
take the system of concepts lying at the root of classical physics. 


Can this system be considered logically perfect? 


CLASSICAL PHYSICIST: It is quite perfect. The concepts of classical 
physics were formed on the basis of prolonged human experience; 


they have stood the test of time. 
AUTHOR: What are the main concepts of classical physics? 


CLASSICAL PHYSICIST: I would indicate three main points: (a) con- 
tinuous variation of physical quantities; (b) the principle of classical 
determinism; (c) the analytical method of studying objects and phe- 
nomena. While talking about continuity, let us remember that the 
state of an object at every instant of time is completely determined 
by describing its coordinates and velocities, which are continuous 
functions of time. This is what forms the basis of the concept of 
motion of objects along trajectories. The change in the state of an 
object may in principle be made as small as possible by reducing the 


time of observation. 


Participants: the Author 

and the Classical Physicist 
(Physicist of the older gener- 
ation, whose views have been 
formed on the basis of classical 


physics alone). 
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Classical determinism assumes that if the state of an object as well as 
all the forces applied to it are known at some instant of time, we can 
precisely predict the state of the object at any subsequent instant. 
Thus, if we know the position and velocity of a freely falling stone 
at a certain instant, we can precisely tell its position and velocity 

at any other instant, for example, at the instant when it hits the 


ground. 


AUTHOR: In other words, classical physics assumes an unambiguous 
and inflexible link between present and future, in the same way as 


between past and present. 


CLASSICAL PHYSICIST: The possibility of such a link is in close 
agreement with the continuous nature of the change of physical 
quantities: for every instant of time we always have an answer to 
two questions: “What are the coordinates of an object?” and, “How 
fast do they change?” Finally, let us discuss the analytical method 
of studying objects and phenomena. Here we come to a very impor- 
tant point in the system of concepts of classical physics. The latter 
treats matter as made up of different parts which, although they 
interact with one another, may be investigated individually. This 
means that firstly, the object may be isolated from its environments 
and treated as an independent entity, and secondly, the object may 
be broken up, if necessary, into its constituents whose analysis could 


lead to an understanding of the nature of the object. 


AUTHOR: It means that classical physics reduces the question “what 


is an object like?” to “what is it made of?” 


CLASSICAL PHYSICIST: Yes, indeed. In order to understand any ap- 
paratus we must “dismantle” it, at least in one’s imagination, into its 
constituents. By the way, everyone tries to do this in his childhood. 
The same is applicable to phenomena: in order to understand the 
idea behind some phenomenon, we have to express it as a function 


of time, i.e. to find out what follows what. 


AUTHOR: But surely such a step will destroy the notion of the object 


or phenomenon as a single unit. 


CLASSICAL PHYSICIST: To some extent. However, the extent of 
this “destruction” can be evaluated each time by taking into account 
the interactions between different parts and relation between the 
time stages of a phenomenon. It may so happen that the initially 
isolated object (a part of it) may considerably change with time 
as a result of its interaction with the surroundings (or interaction 


between parts of the object). However, since these changes are 
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continuous, the individuality of the isolated object can always be 
returned over any period of time. It is worthwhile to stress here the 
internal logical connections among the three fundamental notions 


of classical physics. 


AUTHOR: I would like to add that one special consequence of the 
“principle of analysis” is the notion, characteristic of classical 
physics, of the mutual independence of the object of observation 
and the measuring instrument (or observer). We have an instrument 
and an object of measurement. They can and should be considered 


separately, independently from one another. 


CLASSICAL PHYSICIST: Not quite independently. The inclusion of 
an ammeter in an electric circuit naturally changes the magnitude 
of the current to be measured. However, this change can always be 


calculated if we know the resistance of the ammeter. 


AUTHOR: When speaking of the independence of the instrument and 
the object of measurement, I just meant that their interaction may 


be simply “ignored”. 
CLASSICAL PHYSICIST: In that case I fully agree with you. 


AUTHOR: Born has considered this point in Physics in My Genera- 


tion.!? Characterizing the philosophy of science which influenced 12 Born, M. (1956). Physics In My Gen- 
eration: A Selection of Papers. | Pergamon 


people of older generation”, he referred to the tendency to consider Pisce London 


that the object of investigation and the investigator are completely 
isolated from each other, that one can study physical phenomena 
without interfering with their passage. Born called such style of 
thinking “Newtonian”, since he felt that this was reflected in “New- 


ton’s celestial mechanics.” 


CLASSICAL PHYSICIST: Yes, these are the notions of classical 
physics in general terms. They are based on everyday commonplace 
experience and it may be confidently stated that they are acceptable 
to our common sense, i.e. are taken as quite natural. I rather believe 
that the “principle of analysis” is not only a natural but the only 
effective method of studying matter. It is incomprehensible how one 
can gain a deeper insight into any object or phenomenon without 
studying its components. As regards the principle of classical deter- 
minism, it reflects the causality of phenomena in nature and is in 


full accordance with the idea of physics as an exact science. 


AUTHOR: And yet there are grounds to doubt the “flawlessness” of 


classical concepts even from very general considerations. 


Let us try to extend the principle of classical determinism to the uni- 
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verse as a whole, We must conclude that the positions and velocities 
of all “atoms” in the universe at any instant are precisely determined 
by the positions and velocities of these “atoms” at the preceding in- 
stant. Thus everything that takes place in the world is predetermined 
beforehand, all the events can be fatalistically predicted. According 
to Laplace, we could imagine some “super-being” completely aware 
of the future and the past. In his Theorie analytique des probabilites, 


published in 1820, Laplace wrote!?: 13de Laplace, P. S. (1952). A Philosophi- 
cal Essay on Probabilities. Dover, NY 


An intelligence knowing at a given instant of time all forces acting 

in nature as well as the momentary positions of all things of which 
the universe consists, would be able to comprehend the motions of 
the largest bodies of the world and those of the lightest atoms in one 
single formula, provided his intellect were sufficiently powerful to 
subject all data to analysis, to him nothing would be uncertain, both 


past and future would be present to his eyes. 


It can be seen that an imaginary attempt to extend the principle of 
classical determinism to nature in its entity leads to the emergence 
of the idea of fatalism, which obviously cannot be accepted by 


common sense. 


Next, let us try to apply the “principle of analysis” to an investiga- 
tion of the structure of matter. We shall, in an imaginary way, break 
the object into smaller and smaller fractions, thus arriving finally 
at the molecules constituting the object further “breaking-up” leads 
us to the conclusion that molecules are made up of atoms. We then 
find out that atoms are made up of a nucleus and electrons. Accus- 
tomed to the tendency of splitting, we would like to know what an 
electron is made of. Even if we were able to get an answer to this 
question, we would have obviously asked next: What are the con- 
stituents, which form an electron, made of? And so on. We tend to 
accept the fact that such a “chain” of questions is endless. The same 
common sense will revolt against such a chain even though it is a 


direct consequence of classical thinking. 


Attempts were made at different times to solve the problem of this 
chain. We shall give two examples here. The first one is based on 
Plato’s views on the structure of matter. He assumed that matter is 


made up of four “elements” - earth, water, air and fire. 


Each of these elements is in turn made of atoms having definite 
geometrical forms. The atoms of earth are cubic, those of water 
are icosahedral; while the atoms of air and fire are octahedral and 
tetrahedral, respectively. Finally, each atom was reduced to trian- 


gles. To Plato, a triangle appeared as the simplest and most perfect 


mathematical form, hence it cannot be made up of any constituents. 
In this way, Plato reduced the chain to the purely mathematical 


concept of a triangle and terminated it at this point. 


The other example is characteristic for the beginning of the 20" 
century. It makes use of the external similarity of form between 
the planetary model of the atom and the solar system. It is assumed 
that our solar system is nothing but an isolated atom of some other, 
gigantic world, and an ordinary atom is a sort of “solar system” 

for some third dwarfish world for which “our electron” is like a 
planet. In this case we admit the existence of an infinite row of 
more and more dwarfish worlds, just like more and more gigantic 
worlds. In such a system the structure of matter is described in 
accordance with the primitive “chinese box” principle. The “chinese 
box” principle of hollow tubes, according to which nature has a 
more or less similar structure, was not accepted by all the physicists 
of older generations. However, this principle is quite characteristic 
of classical physics, it conforms to classical concepts, and follows 
directly from the classical principle of analysis. In this connection, 
criticizing Pascal’s views that the smallest and the largest objects 
have the same structure, Langevin’ pointed out that this would 
lead to the same aspects of reality being revealed at all levels. The 
universe should then be reflected in an absolutely identical fashion 
in all objects, though on a much smaller scale. Fortunately, reality 


turns out to be much more diverse and interesting. 


Thus, we are convinced that a successive application of the princi- 
ples of classical physics may, in some cases, lead to results which 
appear doubtful. This indicates the existence of situations for which 
classical principles are not applicable. Thus it is to be expected that 
for a sufficiently strong “breaking-up” of matter, the principle of 
analysis must become redundant (thus the idea of the independence 
of the object of measurement from the measuring instrument must 
also become obsolete). In this context the question “what is an elec- 


tron made of?” would simply appear to have lost its meaning. 


If this is so, we must accept the relativity of the classical concepts 
which are so convenient and dear to us, and replace them with some 
qualitatively new ideas on the motion of matter. The classical at- 
tempts to obtain an endless detailization of objects and phenomena 
mean that the desire inculcated in us over centuries “to study or- 
ganic existence” leads at a certain stage to a “driving out of the soul” 
and a situation arises, where, according to Goethe, “the spiritual link 


is lost”. 
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1 Physics of the Microparticles 





PHYSICS OF THE MICROPARTICLES 


1 Certain Characteristics and Properties of Microparticles 


Molecules, atoms, atomic nuclei and elementary particles belong to the Microparticles 
category of microparticles. The list of elementary particles is at present 
fairly extensive and includes quanta of electromagnetic field (photons) as 
well as two groups of particles, the hadrons and the leptons. Hadrons are 
characterized by a strong (nuclear) interaction, while leptons never take 
part in strong interactions. The electron, the muon and the two neutrinos 
(the electronic and muonic) are leptons. The group of hadrons is numer- 
ically much larger. It includes nucleons (proton and neutron), mesons 
(a group of particles lighter than the proton) and hyperons (a group of 
particles heavier than the neutron). With the exception of photons and 
some neutral mesons, all elementary particles have corresponding anti- 


particles. 


Among properties of microparticles, let us first mention the rest mass 
and electric charge. As an example, we note that the mass m of an electron 


is equal to 9.1 x 10° 


g; a proton has mass equal to 1836 m, a neutron, 
1839 m and a muon, 207 m. Pions (z-mesons) have a mass of about 270 m 
and kaons (K-mesons), about 970 m. The rest mass of a photon and of 


both neutrinos is assumed to be equal to zero. 


The mass of a molecule, atom or atomic nucleus is equal to the sum 
of the masses of the particles constituting the given microparticle, less a 
certain amount known as the mass defect. The mass defect is equal to the 
ratio of the energy that must be expended to break up the microparticle 
into its constituent particles (this energy is usually called the binding en- 
ergy) to the square of velocity of light. The stronger the binding between 
particles, the greater is the mass defect. Nucleons in atomic nuclei have 


the strongest binding-the mass defect for one nucleon exceeds 10 m. 


The magnitude of the electric charge of a microparticle is a multiple of 
the magnitude of the charge of an electron, which is equal to 1.6 x 107"? C 
(4.8 x 1071 CGSE units). Apart from charged microparticles, there also 
exist neutral microparticles (for example, photon, neutrino, neutron). The 
electric charge of a complex microparticle is equal to the algebraic sum of 


the charges of its constituent particles. 


Spin is one of the most important specific characteristics of a micropar- Spin of a Microparticle 
ticle. It may be interpreted as the angular momentum of the microparticle 
not related to its motion as a whole (it is frequently known as the inter- 
nal angular momentum of the microparticle). The square of this angular 
momentum is equal to h?s(s + 1), where s for the given microparticle is 
a definite integral or semi-integral number (it is this number which is 
usually referred to as the spin), ñ is a universal physical constant which 


plays an exceptionally important role in quantum mechanics. It is called 
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Planck’s constant and is equal to 1.05 x 10°" Js. Spin s of a photon is 


equal to 1, that of an electron (or any other lepton) is equal to F while 


pions and kaons don’t have any spin.! Spin is a specific property of a mi- 1The definition of spin of a micropar- 
ticle assumes that spin is independent 

of external conditions. This is true for 
the complex internal structure of the microparticle. True, it is sometimes elementary particles. However, the spin of 
an atom, for example, may change with a 
change in the state of the latter. In other 
ing around its axis (the very word “spin” means “rotate”). Such a mode is words, the spin of an atom may change as 
a result of influences on the atom which 
lead to a change in its state. 


croparticle. It does not have a classical analogue and certainly points to 
attempted to explain the concept of spin on the model of an object rotat- 


descriptive but not true. In any case, it cannot be literally accepted. The 
term “rotating microparticle” that one comes across in the literature does 
not by any means indicate the rotation of the microparticle, but merely 
the existence of a specific internal angular momentum in it. In order that 
this momentum be transformed into “classical” angular momentum (and 
the object thereby actually rotate) it is necessary to satisfy the conditions 


s > 1. Such a condition, however, is usually not satisfied. 


The peculiarity of the angular momentum of a microparticle is mani- 
fested, in particular, in the fact that its projection in any fixed direction 
assumes discrete values his, A(s — 1),...,—fis, thus in total 2s + 1 values. 

It means that the microparticle may exist in 2s + 1 spin states. Conse- 
quently, the existence of spin in a microparticle leads to the appearance of 


additional (internal) degrees of freedom. 


If we know the spin of a microparticle, we can predict its behaviour Bosons and Fermions 
in the collective of microparticles similar to it (in other words, to predict 
the statistical properties of the microparticle). It turns out that all the 
microparticles in nature can be divided into two groups, according to their 
statistical properties: a group with integral values of spin or with zero 


spin, and another with half-integral spin. 


Microparticles of the first group are capable of populating one and 
the same state in unlimited numbers.” Moreover, the more populated is 2The concept of the state of a micropar- 
a given state, the higher is the probability that a microparticle appears ee eee PERUON 3 DEIS 
in this state. Such microparticles are known to obey the Bose-Einstein 
statistics, in short they are simply called bosons. Microparticles of the 
second group may inhabit the states only one at a time, if the state under 
consideration is already occupied, no other microparticle of the given 
type can be accommodated there. Such microparticles obey Fermi-Dirac 


statistics and are called fermions. 


Among elementary particles, photons and mesons are bosons while 
the leptons (in particular, electrons), nucleons and hyperons are fermions. 
The fact that electrons are fermions is reflected in the well-known Pauli 


exclusion principle. 


All elementary particles except the photon, the electron, the proton Instability of Microparticles 


and both neutrinos are unstable. This means that they decay sponta- 
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neously, without any external influence, and are transformed into other 
particles. For example, a neutron spontaneously decays into a proton, an 


electron and an electronic antineutrino: 
n> pte +Ve 


It is impossible to predict precisely at what time a particular neutron 

will decay since each individual act of disintegration occurs randomly. 
However, by following a large number of acts, we find a regularity in 
decay. Suppose there are No neutrons (No > 1) at time t = 0. Then at the 


moment t we are left with 


N(t) = No exp (--) 


neutrons, where T is a certain constant characteristic of neutrons. It is 
called the lifetime of a neutron and is equal to 1 x 10° s. The quantity 
exp (—‘/r) determines the probability that a given neutron will not decay 


in time t. 


Every unstable elementary particle is characterized by its lifetime. 
The smaller the lifetime of a particle, the greater the probability that it 
will decay. For example, the lifetime of a muon is 2.2 x 10° s, that of a 
positively charged 2-meson is 2.6 X 107° s, while for a neutral -meson 
the lifetime is 1 x 107’ s and for hyperons, 1 x 107!" s. In recent years, 
a large number of particles (about 100) have been observed to have an 


0723 


anomalously small lifetime of about 1 x 107°? s to 1 x 1 s. These are 


called resonances. 


It is worth noting that hyperons and mesons may decay in different 
ways. For example, the positively charged 2-meson may decay into a 


muon and a muonic neutrino, 
ma — H + Vu, 
into a positron (anti-electron) and electronic neutrino, 
m* > et + ve, 
into a neutral z-meson, positron and electronic neutrino, 
mt — wo +e + Vo. 


For any particular z-meson, it is impossible to predict not only the time 
of its decay, but also the mode of decay it might “choose”. Instability is 
inherent not only in elementary particles, but also in other microparticles. 
The phenomenon of radioactivity (spontaneous conversion of isotopes of 
one chemical element into isotopes of another, accompanied by emission 


of particles) shows that the atomic nuclei can also be unstable. Atoms and 
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molecules in excited states are also unstable; they spontaneously return to 


their ground state or to a less excited state. 


Instability determined by the probability laws is, apart from spin, the 
second special specific property inherent in microparticles. This may also 
be considered as an indication of a certain “internal complexity” in the 


microparticles. 


In conclusion, we may note that instability is a specific, but by no 
means essential, property of microparticles. Apart from the unstable ones, 
there are many stable microparticles: the photon, the electron, the proton, 
the neutrino, the stable atomic nuclei, as well as atoms and molecules in 


their ground states. 


Looking at the decay scheme of a neutron, Interconversion of Microparti- 


cles 
na pte +Ve, 


an inexperienced reader might presume that a neutron is made up of 
mutually bound proton, electron and electronic antineutrino. Such an 
assumption is wrong. The decay of elementary particles is by no means 

a disintegration in the literal sense of the word; it is just an act of con- 
version of the original particle into a certain aggregate of new particles; 
the original particle is annihilated while new particles are created. The 
unfoundedness of the literal interpretation of the term “decay of particles” 
becomes apparent when one considers that many particles can decay in 


several different ways. 


The interconversion of elementary particles turns out to be much more 
diverse and complicated if we consider particles not only in a free, but 
also in a bound state. A free proton is stable, and a free neutron decays 
according to the equation mentioned above. If, however, the neutron 
and the proton are not free but bound in an atomic nucleus, the situation 
radically changes. Now the following equations of interconversion are 


operative: 


no ptr, 


p>n+r* 


(here, 2” is a negatively charged z-meson, the antiparticle of the 7*- 
meson). These equations very well illustrate that an attempt to find out 
whether the proton is a “constituent” of the neutron, or vice versa is 


pointless. 


Everyday experience teaches us that to break up an object into parts 
means to reveal its structure. The idea of analysis (or splitting) reflects 


the characteristic feature of classical methods. When we go over to the 
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microparticles, this idea still holds to a certain extent: the molecules are 
made up of atoms, the atoms consist of nuclei and electrons, the nuclei 
are made up of protons and neutrons. However, the idea exhausts itself 
at this point: for example, “splitting up” of a neutron or a proton does not 
reveal the structure of these particles. As regards elementary particles, 
when we say that a particle decays into parts, it does not mean that these 
particles constitute the given particle. This condition itself might serve as 


a definition of an elementary particle. 


Decay of elementary particles is not the only kind of interconversion 
of particles. Equally important is the case of interconversion of particles 
when they collide with one another. As an example, we shall consider 
some equations of interconversion during collision of photons (y) with 


protons and neutrons: 


ytprontn, 
ytn—pdt7, 
ytp ptm, 
ytnontn, 
ytproptwta7, 
ytnont+n +n, 
yt+p—>ptptp 


(where p-denotes an antiproton) 


It should be mentioned here that in all the above equations, the sum 
of the rest masses of the end particles is greater than the rest mass of 
the initial ones. In other words, the energy of the colliding particles is 
converted into mass (according to the well-known relation E = mc’). 
These equations demonstrate, in particular, the fruitlessness of efforts to 
break up elementary particles (in this case, nucleons) by “bombarding” 
them with other particles (in this case, photons): in fact, it does not lead to 
a breaking-up of the particles being bombarded at, but to the creation of 
new particles, to some extent at the expense of the energy of the colliding 


particles. 


A study of the interconversion of elementary particles permits us to 
determine certain regularities. These regularities are expressed in the 
form of laws of conservation of certain quantities which play the role 
of some definite characteristics of certain particles. As a simple exam- 
ple, we take the electric charge of a particle. For any interconversion of 
particles, the algebraic sum of electric charges of the initial and end par- 
ticles remains the same. The law of conservation of the electric charge 
refers to a definite regularity in the interconversion of particles: it permits 


one to summarily reject equations where the total electric charge is not 


27 


28 BASIC CONCEPTS OF QUANTUM MECHANICS 


conserved. 


As a more complicated example, we mention the so-called barionic charge of a particle. It has been 


observed that the number of nucleons during an interconversion of particles is conserved. With the 


discovery of antinucleons, it was observed that additional nucleons may be created, but they must 


he created in pairs with these antinucleons. So a new characteristic of particles, the barionic charge, 


was introduced. It is equal to zero for photons, leptons and mesons, +1 for nucleons, and -1 for antin- 


ucleons. This permits us to consider the above-mentioned regularity as a law of conservation of the 


total barionic charge of the particles. The law was also confirmed by the discoveries that followed: the 


hyperons were assigned a barionic charge equal to 1 (as for nucleons) and the antihyperons were given 


a barionic charge equal to -1 (as for antinucleons). 


While going over from macroparticles to microparticles, one would 
expect qualitatively different answers to questions like: Which dynamic 
variables should be used to describe the state of the object? How should 
its motion be depicted? Answers to these questions reveal to a consider- 


able extent the specific nature of microparticles. 


In classical physics, we make use of the laws of conservation of energy, 
momentum and angular momentum. It is well known that these laws are 
consequences of certain properties of the symmetry of space and time. 
Thus, the law of conservation of energy is a consequence of homogeneity 
of time (independence of the course of a physical process of the moment 
chosen as the starting point of the process); the law of conservation of 
momentum is a consequence of the uniformity of space (all points in space 
are physically equivalent); the law of conservation of angular momentum 
is a consequence of the isotropy of space (all directions in space are phys- 
ically equivalent). To elucidate the properties of symmetry of space and 
time, we note, for example, that thanks to these properties, Kepler’s laws 
describing the motion of the planets around the sun are independent of 
the position of the sun in the galaxy, of the orientation in space of the 
plane of motion of the planets and also of the century in which these laws 
were discovered. The connection between the properties of symmetry of 
space and time and the corresponding conservation laws means that the 
energy, momentum and angular momentum can be considered as integrals 
of motion, whose conservation is a consequence of the corresponding 


homogeneity of time and the homogeneity and isotropy of space. 


The absence of any experimental evidence indicating violation of the 
above-mentioned properties of symmetry of space and time for microphe- 
nomena reveals that such dynamic parameters as energy, momentum 
and angular momentum should retain their meaning when applied to mi- 
croparticles. In other words, the connection of these dynamic variables 
with the fundamental properties of symmetry in space and time makes 


them universal variables, i.e. variables which are “used” while considering 


Universal Dynamic Variables 
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the kind of phenomena occurring in different branches of physics. 


When transferring the concepts of energy, momentum and angular 
momentum from classical physics to quantum mechanics, however, the 
specific nature of the microparticles must be taken into account. In this 
connection we recall the well-known expressions for energy (E), momen- 
tum (p) and angular momentum (M) of a classical object, having mass m, 


coordinate F, velocity 7: 


=m (1.1) 


Eliminating the velocity, we get from here the relations connecting en- 


ergy, momentum and angular momentum of a classical object: 


2 


E= +U) (1.2) 
2m 
M = (#x®) (1.3) 


If we turn to a microparticle, we can go a bit farther, (see Section 3) and 
conclude that relations (1.2) and (1.3) are no longer valid. In other words, 
the classical connections between the integrals of motion become useless 
as we go over to microparticles (as regards relations (1.1), they cannot be 
mentioned at all since the very concept of the velocity of a microparticle, 
as we shall see below, is meaningless). This is the first qualitatively new 


circumstance. 


In order to consider the other qualitatively new circumstances, we 
must turn to the fundamental ideas of quantum mechanics, i.e. the idea of 


quantization of physical quantities and the idea of wave-particle duality. 


2 Two Fundamental Ideas of Quantum Mechanics 
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The essence of the idea of quantization lies in the fact that certain The Idea of Quantization (Dis- 
physical quantities related to the microparticles may assume, under rele- creteness) 
vant circumstances, only certain discrete values. These quantities are said 
to be quantized. £ 

Thus the energy of any microparticle in a bound state, like that of an fs 


electron in an atom, is quantized. The energy of a freely moving micropar- 





ticle, however is not quantized. 
4; é 


Let us consider the energy of an electron in an atom. The system of 


so-called energy levels corresponds to a discrete set of values of electron 


; E Figure 2.1: Energy levels of an electron 
energy. We consider two energy levels E; and Ez as shown in Figure 2.1 : E 
and quantum transition. 


(the values of electron energy are plotted along the vertical axis). The elec- 
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tron may possess energy E; or Ez and cannot possess any intermediate 
energy — all values of energy E satisfying the condition E} < E < Ez are 
forbidden for it. It should be noted that the discreteness of energy does 
not mean in any case that the electron is “doomed” to remain forever in 
the initial energy state (for example on level E,) . The electron may go 
over to another energy state (level E; or any other) by acquiring or re- 
leasing the corresponding amount of energy. Such a transition is called a 
quantum transition. The quantum-mechanical idea of discreteness has a 
fairly long history. By the end of 19" century, it was established that the 
radiation spectra of free atoms are line spectra (i.e. they consist of sets of 
lines) and contain, for every element, definite lines which form ordered 
groups (series). In 1885, it was discovered that atomic hydrogen emits 
radiation of frequencies wn (henceforth we use the cyclic frequencies w, 
related to the normal frequencies v through the equation w = 27v) which 


may be described by the formula 


7 1 1 
On = 2mcR rae (2.1) 
where n are integral numbers 3, 4,5,..., c is the velocity of light, R is the 


so-called Rydberg constant (R = 1.097 x 10’/m). Formula (2.1) was derived 
by Balmer, hence the set of frequencies described by this formula is called 
the Balmer series. The frequencies of the Balmer series fall in the visible 


region of spectrum. Later (in the beginning of 20" 


century), additional 
series of radiation from atomic hydrogen falling in the ultraviolet and 
infrared regions were discovered. The regularities in the structure of these 
spectra were identical to the regularities in the structure of the Balmer 
series, which enabled a generalization of formula (2.1) in the following 
form: 
1 1 

On = 2mcR (a - =| (2.2) 
The number k fixes the series (in each series n > k);k = 2 gives the 
Balmer series, k = 1, the Lyman series (ultraviolet frequencies), k = 3, the 


Paschen series (infrared frequencies), and so on. 


Regularity in the structure of series was observed not only in the 
spectrum of atomic hydrogen, but also in the spectra of other atoms. It 
definitely indicated the possibility of some generalizations. One such 
generalization was proposed by Ritz in 1908 in his combination principle, 
which states that if the formulae of series are given and the constants 
occurring in them are known, any newly discovered line in the spectrum 
may be obtained from the lines already known by means of combinations 
in the form of sums and differences. This principle may be applied to 
hydrogen in the following way: we write the so-called spectral terms for 


different numbers n: 
2mcR 


n2 





T(n) = 


3A specific situation in quantum 
mechanics is possible in which one must 
assume that an electron occupies level E1 
as well as level E2 (see Section 10) 
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Then each frequency observed in the hydrogen spectrum may be ex- 
pressed as a combination of two spectral terms. By combining the spectral 


terms, it is possible to predict different frequencies. 


It is remarkable that at about the same time, the idea of discreteness 
arose in another direction (not related to atomic spectroscopy). This is the 
case of radiation within a closed volume or, in other words, black body 
radiation. After analyzing the experimental data, Planck in 1900 proposed 
his famous hypothesis. He suggested that the energy of electromagnetic 
radiation is emitted by the walls of a cavity not continuously, but in por- 


tions (quanta), the energy of a quantum being equal to 
E=how (2.3) 


where w is the frequency of radiation and fi is a certain universal constant 
(it later became known as Planck’s constant). Planck’s hypothesis pro- 
vided an agreement between the theory and experiment and, in particular, 
removed the flaws arising in the previous theory when passing to higher 
frequencies. This had been called the “ultraviolet catastrophe” (see, for 


example, Volume 3 of Savelyev’s General Physics).4 


In 1913, Bohr proposed his theory of the hydrogen atom. This theory 
was evolved as a “confluence” of the planetary atomic model by Ruther- 
ford, the Ritz combination principle, and Planck’s ideas of quantization of 


energy. 


According to Bohr’s theory, there exist certain states in which the 
atom does not radiate (stationary states). The energy of these states forms 
a discrete spectrum, E,, Ex,...,E,.... The atom emits (absorbs) during 
transition from one stationary state to another. The emitted (absorbed) 
energy is the difference between the energies of the corresponding sta- 
tionary states. Thus, during a transition from the state with energy En 
to the state with lower energy E, , a quantum of radiation with energy 
(En — Ex) is emitted. Thus a line with frequency 

_ En - Ex 


R (2.4) 


appears in the spectrum. Formula (2.4) expresses the well-known Bohr’s 


frequency condition. 


In Bohr’s theory, the n-th stationary state of hydrogen atom corre- 
sponds to a circular orbit of radius r, along which the electron revolves 
around the nucleus. In order to compute rn , Bohr suggested, firstly, using 
Newton’s second law for a charge moving in a circle under the influence 
of a Coulomb’s force: 


2 
(2.5a) 


3 

| 

Il 
h| S 


Savelyev, I. (1980). Physics - A General 
Course - Volume 3, volume 3. Mir Publish- 
ers, Moscow 


The Idea of Quantization and 
Bohr’s Model of Hydrogen 
Atom 
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(here m and e are the mass and the charge of an electron, vpn is the velocity 
of the electron in the n-th orbit). Secondly, Bohr suggested the condition 


of quantization of the angular momentum of the electron: 
MUnrn = nh (2.5b) 
By using relations (2.5a) and (2.5b), it is easy to find rp and vp: 
W\ 
Fn = Fa n 


e? 


~ jin 


(2.6) 


Un 


The energy En of the stationary state consists of kinetic (Tp) and potential 
(Un) terms: 
En = Tn + Un 


mv? e? 
Assuming that T, = oe U, = —— and using (2.6), we find that 


n 





met 


Be 2h? n2 


(2.7) 


The negative sign of the energy means that the electron is in a bound 


state (energy of a free electron is taken to be equal to zero). 


Substituting the result (2.7) into the frequency relation (2.4), and com- 
paring the expression thus obtained with formula (2.2), we may, following 


Bohr, find an expression for Rydberg’s constant: 


mef 


= ——_— 2.8 
4rch? 28) 


Bohr’s theory (or the old quantum theory, as it is now called) suffered 
from internal contradictions; in order to determine the radius of the orbit, 
one had to make use of relations of different kinds - the classical relation 
(2.5a), and the quantum relation (2.5b). In spite of this, the theory was 

of great significance as a first step towards the creation of a consistent 
quantum theory. Moreover, the nature of the spectral terms, and, conse- 
quently, the Ritz combination principle, was revealed for the first time 

and the calculated value of Rydberg’s constant was in excellent agreement 
with its empirical value. The success of the theory proved testimony to 
the usefulness of the idea of quantization. Having acquainted himself with 


Bohr’s calculations, Sommerfeld wrote Bohr a letter, in which he said: 


I thank you very much for sending me your extremely interesting work ... 

The problem of expressing the Rydberg-Ritz constant by Planck’s has been 
for some time in my thoughts... Although | am for the present still rather 
sceptical about atom models in general, nevertheless the calculation of the 


constant is indisputably a great achievement. 


We must note that in contrast to energy, the angular momentum of a On Quantization of Angular 


Momentum 
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microparticle is always quantized. Thus, the observed values of the square 


of angular momentum of a microparticle are expressed by the formula 
M? = h71(1 + 1), (2.9a) 


where l is an integer 0, 1, 2,.... If we consider the angular momentum of 
an electron in the atom in the n-th stationary state, the number / assumes 


values from 0 ton — 1. 


In the literature, it is customary to refer to the angular momentum as 


simply momentum, Henceforth, we shall follow this practice. 


The projection of the momentum of a microparticle in a certain direc- 


tion (let us denote it as z-direction) assumes the values 
Mz =hm (2.9b) 


where m = —1,—-1+1,...,/-1,1. For a given value of the number I, the num- 
ber m can assume 2l + 1 discrete values. We emphasize here that different 
projections of the momentum of a microparticle in a given direction differ 


from one another by values which are multiples of Planck’s constant. 


It was mentioned above that spin is a distinctive “internal” momentum 
of a microparticle having a definite value for a given microparticle. To 
distinguish it from the spin momentum, ordinary momentum is called 
orbital momentum. Kinematically the spin momentum is analogous, to 
the orbital momentum. Naturally, in order to find the possible projections 
of the spin momentum we must use a formula of the type (2.9b) (as in 
the case of orbital momentum, the projections of the spin momentum 
differ from one another by integral multiples of Planck’s constant). If s 
is the spin of a microparticle (this number was introduced in Section 1), 
then the projection of the spin momentum assumes values fio, where o = 
=s, —s+1, ...s—1, s. Thus, the projection of the spin of an electron assumes 
values = and +—. The numbers n, l, m, o considered here determine the 
different discrete values of the quantized dynamic variables (in this case, 
energy and momentum), and are called quantum numbers; n is called 
the principal quantum number; I, the orbital quantum number; m, the 
magnetic quantum number and o, the spin quantum number. There also 


exist other quantum numbers. 


33 


In spite of the resounding success of Bohr’s theory, the idea of quan- Anomalies of Quantum Transi- 


tization engendered serious doubts in the beginning. It was noticed that tions 


the idea was full of internal contradictions. Thus in his letter to Bohr, 


Rutherford wrote in 19137: 5 Bohr, N. (1961). Reminiscences of the 
founder of nuclear science and of some 


developments based on his work (the 
rutherford memorial lecture, 1958). 
ingenious and seem to work out well; but the mixture of Planck’s ideas Phys. Soc., 78:1083-1115 


.. Your ideas as to the mode of origin of the spectrum of hydrogen are very 


Proc. 


34 BASIC CONCEPTS OF QUANTUM MECHANICS 


with the old mechanics makes it very difficult to form a physical idea of 
what is the basis of it. There appears to me one grave difficulty in your 
hypothesis which I have no doubt you fully realise namely, how does an 
electron decide what frequency it is going to vibrate at when it passes from 
one stationary state to the other? It seems to me that you would have to 


assume that the electron knows beforehand where it is going to stop®... 


We shall explain the difficulties noticed by Rutherford. Let an electron oc- 
cupy level E; (Figure 2.1). In order to go over to the level Ez, the electron 
must absorb a quantum of radiation (i.e, a photon) with a definite energy 
equal to (Ez — E1). Absorption of a photon with any other energy will not 
result in the indicated transition and is therefore not possible (for simplic- 
ity, we shall consider only two levels). The question now arises: In what 
way does an electron perform a “selection” of the “required” photon out 
of the photon flux of different energies falling on it? In order to “select” 
the “required” photon, the electron must be “previously aware” of the sec- 
ond level, i.e. as if it had already visited it. However, in order to visit the 
second level, the electron must have first absorbed the “required” photon. 


This gives rise to a vicious circle. 


©The reader should not be confused 
by the remarks about the oscillations of 
electron: uniform motion in a circle is a 
superposition of two harmonic oscillations 
in mutually perpendicular directions. 


Further contradictions are observed while considering the jump of an electron from one orbit in the 


atom to another. Whatever the speed at which the transition of the electron from the orbit of one 


radius to that of another takes place, it has to last for some finite period of time (otherwise it would he 


a violation of the basic requirements of the theory of relativity). But then it is hard to understand what 


the energy of the electron should be during this intermediate period - the electron no longer occupies 


the orbit corresponding to energy E; and has not yet arrived at the orbit corresponding to energy Ep. 


It is thus not surprising that at one time efforts were made to obain 
an explanation of experimental results without resorting to the idea of 
quantization. In this respect, the famous remarks by Schrödinger about 
“these damned quantum jumps”, which, of course, were made in the heat 


of the moment, are worth noting. 


However, experience inevitably pointed to the usefulness of quantiza- 


tion and no place was left for an alternative. 


In this case, there is just one way out: new ideas must be introduced, 
which form a non-contradictory picture of the whole including the ideas 
of discreteness. The idea of wave-particle duality was just such a new 


physical concept. 


Classical physics acquaints us with two types of motion: corpuscular 
and wave motion. The first type is characterized by a localization of the 
object in space and the existence of a definite trajectory of its motion. The 
second type, on the contrary, is characterized by delocalization in space. 


No localized object corresponds to the motion of a wave, it is the motion 


Idea of Wave-Particle Duality 
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of a medium. In the world of macrophenomena, the corpuscular and wave 
motions are clearly distinguished. The motion of a stone thrown upward 
is something entirely different from the motion of a wave breaking a 


beach. 


These usual concepts, however, cannot be transferred to quantum 
mechanics. In the world of microparticles, the above-mentioned strict 
demarcation between the two types of motion is considerably obliterated. 
The motion of a microparticle is characterized simultaneously by wave 
and corpuscular properties. If we schematically consider the classical par- 
ticles and classical waves as two extreme cases of the motion of matter, 
microparticles must occupy in this scheme a place somewhere in between. 
They are not “purely” (in the classical sense) corpuscular, and at the same 
time they are not “purely” wavelike; they are something qualitatively 
different. It may be said that a microparticle to some extent is akin to a 
corpuscle, and in some respect it is like a wave. Moreover, the extent de- 
pends, in particular, on the conditions under which the microparticle is 
considered. While in classical physics a corpuscle and a wave are two mu- 
tually exclusive extremities (either particle, or wave), these extremities, at 
the level of microphenomena, combine dialectically within the framework 


of a single microparticle. This is known as wave-particle duality. 


The idea of duality was first applied to electromagnetic radiation. As 
early as 1917, Einstein suggested that quanta of radiation, introduced by 
Planck, should be considered as particles possessing not only a definite 


energy, but also a definite momentum: 


E=ho 
ho (2.10) 
PSr 


Later (from 1923), these particles became known as photons. 


The corpuscular properties of radiation were very clearly demonstrated in the Compton effect (1923). 
Suppose a beam of X-rays is scattered by atoms of matter. According to classical concepts, the scat- 
tered rays should have the same wavelength as the incident rays. However, experiment shows that 
the wavelength of scattered waves was greater than the initial wavelength of the rays. Moreover, the 
difference between the wavelengths depends on the angle of scattering. The Compton effect was 
explained by assuming that the X-ray beam behaves like a flux of photons which undergo elastic col- 
lisions with the electrons of the atoms, in conformity with the laws of conservation of energy and 
momentum for colliding particles. This led not only to a qualitative but also to a quantitative agree- 


ment with experiment (see Savelyev”). 


7 
In 1924, de Broglie suggested that the idea of duality should be ex- Savelyev, l:(1980); “Physics = A General 
Course - Volume 3, volume 3. Mir Publish- 

tended not only to radiation but also to all microparticles. He proposed ers, Moscow 


to associate with every microparticle corpuscular characteristics (energy E 


and momentum p) on the one hand and wave characteristics (frequency w 
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and wavelength 4) on the other hand. The mutual dependence between 
the characteristics of different kinds was accomplished, according to de 


Broglie, through the Planck’s constant in the following way: 


E=ho 
onh (2.11) 
P= 


(the second relation is known as de Broglie’s equation). For photons, re- 

; A i ‘ 5 : TC., 
lation (2.11) is automatically satisfied if we substitute w = FE in (2.10). 
The boldness of de Broglie’s hypothesis lays in that relation (2.11) was 
assumed to be satisfied not only for photons, but generally for all mi- 
croparticles, and in particular, for those which have a rest mass and which 


were hitherto associated with corpuscles. 


De Broglie’s ideas received confirmation in 1927, with the discovery 
of electron diffraction. While studying the passage of electrons through 
thin foils, Davisson and Germer (as well as Tartakovsky) observed char- 
acteristic diffraction rings on the detector screen. For “electron waves” 
the crystal lattice of the target served as a diffraction grating. Measure- 
ment of distances between diffraction rings for electrons of a given energy 


confirmed de Broglie’s formula. 


In 1949, Fabrikant and coworkers set up an interesting experiment. 
They passed an extremely weak electron beam through the diffraction 
apparatus. The interval between successive acts of passage (between two 
electrons) was more than 1 x 10* times longer than the time required for 
the passage of an electron through the apparatus. This ensured that other 
electrons of the beam do not influence the behaviour of an electron. The 
experiment showed that for a prolonged exposure, permitting registration 
of a large number of electrons on the detector screen, the same diffraction 
pattern was observed as in the case of regular electron beams. It was thus 
concluded that the wave nature of the electrons cannot be explained as 
an effect of the electron aggregate; every single electron possesses wave 


properties. 


The idea of quantization introduces discreteness, and discreteness 
requires a unit of measure. Planck’s constant plays the role of such a 
measure. It may be said that this constant determines the “boundary” 
between microphenomena and macrophenomena. By using Planck’s 
constant, as well as mass and charge of an electron, we may form the 
following simple composition having dimensions of length: 

2 


h 
rı = — =0.53x 10° cm (2.12) 
me 


(note that r; is the radius of the first Bohr orbit). According to (2.12), a 


The Role of Planck’s Constant 
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magnitude of about 1 x 1078 cm may be considered as the spatial “bound- 
ary” of microphenomena. This is just about the linear dimensions of an 


atom. 


If the Planck constant f were, say, 100 times larger, then (other con- 
ditions being equal) the “limit” of microphenomena would, according to 
(2.12), have been of the order of 1 x 10~ cm. This would mean that the 
microphenomena would become much closer to us, to our scale, and the 
atoms would have been much bigger. In other words, matter in this case 
would have appeared much “coarser”, and classical concepts would have 


to be revised on a much larger scale. 


As was indicated above, the projections of the momentum of a mi- 
croparticle differ from one another by multiples of fi [see (2.9b)]. Conse- 
quently, Planck’s constant appears here as a unit of quantization. If the 
orbital momentum is much greater than h, its quantization may be ne- 
glected. We get in this case the classical angular momentum. In contrast 
to the orbital momentum, spin momentum cannot be very large. It is clear, 
that it is impossible to neglect its quantization in principle; hence the spin 
momentum does not have a classical analogue (this circumstance was 


already indicated in Section 1). 


Planck’s constant is inseparably linked not only with the idea of quanti- 
zation, but also with the idea of duality. From (2.11) it is evident that this 
constant plays a fairly important role - it supplies a “link” between the 
corpuscular and wave properties of a microparticle. This becomes quite 
clear if we rewrite (2.11) in a form permitting us to take account of the 


vector nature of momentum: 


E = hw 
A (2.13) 
p=ħk 


Here k is the wave vector; its direction coincides with the direction of 
propagation of the wave, and its magnitude is expressed through the 
wavelength in the following way: 


k = — 
À 


The left-hand sides of equations (2.13) describe corpuscular properties of a 
microparticle, and the right-hand sides wave properties. We note, by the 
way, that the form of relations (2.13) indicates the relativistic invariance of 


the idea of duality. 


Thus, Planck’s constant plays two fundamental roles in quantum me- 
chanics - it serves as a measure of discreteness, and it combines the cor- 
puscular and wave aspects of the motion of matter. The fact that the same 


constant plays both these roles is an indirect indication of the internal 
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unity of the two fundamental ideas of quantum mechanics. 


In conclusion, we remark that the presence of Planck’s constant in any 


expression indicates the “quantum-mechanical nature” of this expression.® 


3 Uncertainty Relations 


Let us consider an aggregate of a large number of plane waves (the 
nature of waves is not important) propagating, say, along the x-axis. Let 
the frequencies of the waves be “spread” over a certain interval Aw, and 
the values of the wave vector, over an interval Ak,. If all these plane 
waves are superimposed on one another, we get a wave formation limited 
in space called a wave packet (Figure 3.1). The spreading of the wave 


packet in space (Ax) and in time (At) is determined by the relations 


Aw At 2 1 


(3.1) 
Ak, Ax > 1 


These relations are well known in classical physics. Those acquainted 
with radio engineering know that for a more localized signal one must 
take more plane waves with different frequencies. In other words, to 


reduce Ax and At, one must increase Ak, and Aw. 


Digressing from the wave packet, we shall formally assume that rela- 
tions (3.1) are valid not only for classical waves, but also for wave charac- 
teristics of a microparticle. We stress that this assumption by no means 
indicates that we shall in fact model a microparticle in the form of a wave 
packet. By considering w and k, in (3.1) as wave characteristics of a mi- 
croparticle and making use of relations (2.13), it is easy to go over to an 
analogous expression for the corpuscular characteristics of a microparticle 


(for its energy and momentum): 


AE At > h (3.2) 
h 


Apx Ax > (3.3) 


These relations were first introduced by Heisenberg in 1927 and are called 


uncertainty relations. 


Relations (3.2) and (3.3) should be supplemented by the following 
uncertainty relation: 
AM, Agx > h (3.4) 


where Ag, is the uncertainty in the angular coordinates of the microparti- 
cle (we consider rotation around the x-axis) and AM, is the uncertainty in 


the projection of the momentum on the x-axis.” 


By analogy with (3.3) and (3.4), one may write down relations for other 


8The converse statement is not true. It 
would be incorrect to attempt, as is some- 
times done, to reduce the whole “essence” 
or quantum mechanics to the presence 
of Planck’s constant. This question is 
considered in Podgretsky. 

Podgretsky, M. 1. (1976). Complemen- 
tarity Principle and Materialistic Dialectics, 
chapter Planck’s constant ñ and the com- 
plementarity principle. Nauka, Moscow 


Idea of Duality and Uncer- 


tainty Relations 


Ax 


Figure 3.1: The wave packet. 


Notice that relations (3.4) and (3.4a) 
are valid only for small values of the un- 
certainty in angular coordinate (Ag « 27) 
or, in other words, for large values of 
uncertainty in the projection of the 
momentum. 
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projections of momentum and angular momentum: 


Apy Ay > A, Apz Az > h, (3.3a) 
>h 


AM, A@y > ħ, AM, Ag; > hh (3.4a) 


Let us consider relation (3.3). Here Ax is the uncertainty in the x-coordinate The Meaning of the Uncer- 
of the microparticle and Ap, the uncertainty in the x-projection of its mo- tainty Relations 

mentum. The smaller Ax is, the greater Ap, is, and vice versa. If the mi- 

croparticle is localized at a certain definite point x, then the x-projection 

of its momentum must have arbitrarily large uncertainty. If, on the con- 

trary, the microparticle is in a state with a definite value of px, then it 


cannot be localized exactly on the x-axis. 


Sometimes the uncertainty relation (3.3) is interpreted in the following 
way: it is impossible to measure simultaneously the coordinate and mo- 
mentum of a microparticle with an arbitrarily high precision; the more 
accurately we measure the coordinate, the less accurately can the mo- 
mentum be determined. Such an interpretation is not very good since it 
might lead to the erroneous conclusion that the essense of the uncertainty 
relation (3.3) is responsible for limitations associated with the process 
of measurement. One might be led to assume that a microparticle itself 
possesses a definite coordinate as well as a definite momentum, but the 


uncertainty relation does not permit us to measure them simultaneously. 


Actually the situation is quite different. The microparticle itself simply 
cannot have simultaneously a definite coordinate and a corresponding 
definite projection of the momentum. If, for example, it is in a state with a 
more definite value of the coordinate, then in this state the corresponding 
projection of its momentum is less definite. From this the actual impos- 
sibility of simultaneous measurements of coordinates and momenta of a 
microparticle follows naturally. This is a result of the specific character 
of the microparticle and is by no means a whim of nature which makes 
it impossible for us to perceive all that exists. Consequently, the sense of 
relation (3.3) is not that it creates certain obstacles to the understanding of 
microphenomena, but that it reflects certain peculiarities of the objective 
properties of a microparticle. The last remark is, of course, of a general 
nature: it refers not only to relation (3.3), but also to other uncertainty 
relations. Now let us look at relation (3.2). Let us consider two different, 
though mutually supporting interpretations, of this relation. Suppose that 
the microparticle is unstable and that At is its lifetime in the state under 
consideration. The energy of the microparticle in this state must have an 
uncertainty AE which is related to the lifetime At through inequality (3.2). 
In particular, if the state is stationary (At is arbitrarily large), the energy 


of the microparticle will be precisely determined (AE = 0). 


The other interpretation of relation (3.2) is connected with the mea- 
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surements carried out to ascertain whether the microparticle is located 
at the level E; or Ez. Such a measurement requires a finite time T which 


depends on the distance between the levels (E; — Ej) : 
(E2 = E,)T > h (3.2a) 


It is not difficult to see the connection between these two interpretations. 
In order to distinguish the levels E; and Ep, it is necessary that the uncer- 
tainty AE in the energy of the microparticle should not be greater than 


the distance between the levels: 
AE < (E; — E1). 


At the same time the duration of measurement T should obviously not 


exceed the lifetime At of the microparticle in the given state: 
T<At 


Consequently, the limiting conditions, under which measurement is still 


possible, are given by 
AE x E; - Fi, T = At 
By using (3.2), we can arrive at (3.2a) from these relations. 


The uncertainty relations (3.2)-(3.4) show how the concepts of energy, 
momentum and angular momentum should be applied in the case of mi- 
croparticles. Here, a very important peculiarity of the physics of micropar- 
ticles is revealed: the energy, momentum and the angular momentum 
of a microparticle have meaning only within the limitations imposed by 


the uncertainty relations. Heisenberg!” writes that we cannot interpret 10 Heisenberg, W. (1958). Die planck- 
ische entdeckung und die philosophischen 

grundsfragen der atomlehre. Naturwis- 
If we make use of the usual concepts, their applicability is limited by the senschaften, 45:227-234 


processes on an atomic scale in the same way as processes on a large scale. 


uncertainty relations. 


It should, however, be pointed out that the uncertainty relations do 
not in any way lead to the above-mentioned restrictions on the applica- 
bility of classical concepts of coordinates, momentum, energy, etc. for 
microparticles. It would be unfair not to mention the considerable “pos- 
itive aspects” of uncertainty relations after having talked about their 
“negative aspects”. They serve as a working instrument of the quantum 
theory. By reflecting the specific character of the physics of microparti- 
cles, the uncertainty relations allow us to obtain fairly important results 


through fairly simple means. Some examples are given in Section 4 below. 


The method of deriving the uncertainty relations considered in the be- From Diffraction in Microparti- 
ginning of this section might appear too formal and unconvincing to some cles to Uncertainty Relations 


readers. There are various means of deriving uncertainty relations (see, 
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for example Heisenberg (1930)'"). One such method [which is specifically 
applied to relations (3.3)] is based on a consideration of the phenomena of 


diffraction of microparticles. 


Suppose (Figure 3.2) a screen with a narrow slit is placed in the path 
of a strictly parallel beam of certain microparticles with momentum p. 
Let d be the width of the slit along the x-axis (the x-axis is perpendicular 
to the direction of the beam). Diffraction takes place during the passage 
of microparticles through the slit. Let 0 be the angle between the initial 
direction and the direction of the first (principal) diffraction peak. The 
classical wave theory gives the following well-known relation for this 
angle: 


sin 0 = F 


Assuming angle 0 to be sufficiently small, we can rewrite this relation in 
the following form: 


(3.5) 


D 
x 
al> 


If by A we now mean not the classical wavelength, but the length of the 
de Broglie wave (i.e. the wave characteristic of the microparticle, we may 
rewrite relation (3.5) in “corpuscular language” by using the expression 


(2.11): 
h 


N a (3.5a) 
But how we are to understand the existence of the angle 0 in “corpuscular 
language”? Obviously, it means that while passing through the slit, the 
microparticle acquires a certain momentum Ap, in the direction of the 


x-axis. It is easy to see that Ap, ~ p0. Substituting (3.5a) into this, we get 


al> 


Apx © 


By considering the quantity d as the uncertainty Ax in the x-coordinate of 


the microparticle passing through the slit, we get 
Apx Ax = h, 


i.e. we arrive at the uncertainty relation (3.3). Thus the attempt to de- 
termine in some way the coordinate of a microparticle in a direction 
perpendicular to the direction of its motion leads to an uncertainty in the 
momentum of the microparticle in that direction, which also explains the 


phenomenon of diffraction observed in the experiment. 


In order to describe the state of a classical object it is necessary to 
give a definite set of numbers - the coordinates and the velocity compo- 
nents. In doing this other quantities, in particular, energy, momentum 
and angular momentum of the object will also be determined [see (1.1)] 


The uncertainty relations show that this method of defining a state is 


z Heisenberg, W. (1930). The Physical 
Principles of the Quantum Theory. The Uni- 
versity of Chicago Press, Chicago 
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Figure 3.2: Diffraction of microparticles 
from a slit. 


Uncertainty Relations and the 
State of Microparticles. The 
Concept of a Complete Set of 
Physical Quantities 
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not applicable to microparticles. Thus, for example, the existence of a 
definite projection of momentum in a given direction for a microparticle 
means that the position of the microparticle in this direction cannot be 
determined unambiguously: according to (3.3), the corresponding spatial 
coordinate is characterized by an infinitely large uncertainty. The electron 
in an atom has a definite energy; moreover its coordinates are character- 
ized by an uncertainty of the order of the linear dimensions of the atom. 
This [according to (3.3)] leads to an uncertainty in the projection of the 
momentum of the electron equal to the ratio of Planck’s constant to the 


linear dimension of the atom. 


We now indicate the following situations, fundamental in quantum 


mechanics, which arise from the uncertainty relation: 


(a) various dynamic variables of a microparticle are combined in 
sets of simultaneously determined (simultaneously measurable) 


quantities, the so-called complete sets of quantities; 


(b) various states of a microparticle are combined in groups of states 


corresponding to different complete sets of quantities. 


Each group contains the states of the microparticle in which the values of 
the corresponding complete sets are known (it is customary to say that 


every complete set has its own method of defining its states). 


We shall give examples of the complete sets employed for determining 
the states of, say, an electron and a photon. Each of the sets includes four 
quantities (because of this we say that a microparticle like an electron or a 
photon has four degrees of freedom). To describe the states of an electron, 


the following sets are employed: 


xX, Y, Z, 0 (3.6a) 
Px Py Pz oO (3.6b) 
E, l, m, o (3.6c) 


(remember that I, m, and o are orbital, magnetic and spin quantum num- 
bers, respectively). We emphasize that the coordinates and the momen- 
tum components of a microparticle (in this case an electron) fall in differ- 
ent complete sets of quantities; these two physical quantities cannot be 
measured simultaneously. Hence the classical relations (1.2) and (1.3) are 
not valid when going over to microparticles, since each of these relations 


contains the coordinates as well as the momentum. 


The set (3.6b) is used, in particular, for a describing the states of a 
freely moving electron. Moreover, the energy of the electron also turns 
out to be definable!?: 1274 contrast to relation (1.2), the clas- 


sical relation E = 2- remains valid for a 


m 
freely moving particle when going over to 
microparticles. 
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2m 
The set (3.6c) is usually employed for describing the states of an electron 
in the atom. To describe the states of a photon, the following sets are most 


commonly employed: 


kx, ky, kz, a (3.7a) 
E, M?, Mz, P (3.7b) 


Here kx, ky, kz are the projections of the wave vector of the radiation; æ 
is the polarization of the photon; M” and M; are the square of the momen- 
tum and the projection of the momentum of the photon, respectively; P is 
a quantum number called the spatial parity. We notice that as soon as the 
projections of the wave vector of radiation are determined, the projections 
of the photon momentum are also known (recall that p = hk ). The polar- 
ization of a photon may take two values corresponding completely to the 
two independent polarizations of a classical wave (thus, for example, one 


might talk about a photon having right elliptical polarization). 


The spatial parity is a characteristic property of a microparticle; it may 
be considered as an integral of motion whose conservation is a result of 
symmetry with respect to the operation of reflection by a mirror. At a 
later stage (see Section 20) we shall discuss it in greater detail. Here, we 


only mention that parity can assume one of the two values: +1 and -1. 


The set (3.7a) is used for describing the states of photons correspond- 
ing to plane classical waves, in this case the energy of the photon is also 
defined (recall that E = hw = fick). The states described by the set (3.7a) 
are called ka-states. The set (3.7b) is employed for describing the states 
of photons belonging to spherical classical waves. We note that just as 
a spherical wave may be represented as a superposition of plane waves, 
the states of a photon described by the set (3.7b) may be represented as 
a “superposition” of states described by the set (3.7a). The converse state- 
ment regarding the representation of plane waves as a superposition of 
spherical waves is also true. Here we have touched upon (for the present 
just touched upon) one of the most important and delicate aspects of the 
quantum-mechanical description of matter - the specific character of the 
“interrelations” between states of a microparticle described by different 
complete sets. This specific character is reflected in the most constructive 
principle of quantum mechanics - the principle of superposition of states. 
The superposition of states will be considered in detail in the second chap- 


ter; here we shall just restrict ourselves to the above-mentioned remarks. 
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The main contradiction regarding quantum transitions indicated in Sec- The Uncertainty Relations and 


tion 2 is essentially overcome by making use of the idea of duality or, Quantum Transitions 
more precisely, the uncertainty relation (3.2). Let us consider transition of 


an electron in an atom from level E; to level E2 by absorbing a photon of 
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energy fiw = E; — E;. We recall that the contradiction in transition was 
connected with the question whether the absorption of the photon pre- 
cedes the transition of the electron or vice versa. It is easy to see that this 
question simply loses its meaning now. In fact, if we have a bound elec- 
tron with energies E, and E; before and after interaction with radiation, 
respectively, then during the interaction we have one quantum-mechanical 
system including both the electron and the radiation. This system exists 
for a definite time (while the interaction with the radiation takes place) 
and, according to (3.2), cannot have any definite energy. Hence it is mean- 
ingless to find out precisely what takes place in such a system. Strictly 
speaking, during the interaction of the electron with the photon there is 
no electron and no photon, but a single entity which must be treated as 
such, without going into details. This example shows that in quantum 
mechanics a physical process cannot be infinitely detailed in time. The 
question “what follows what”? cannot always be posed in the case of 


: 13 
microphenomena. 


13The anomaly of quantum transition 
is completely removed by considering the 
principle of superposition of states (see 
Section 10). 


The uncertainty relation (3.2) allows us to introduce and employ a very important concept in quan- 


tum theory, the so-called virtual transitions, for explaining quantum transitions. We shall give here a 


simplified treatment of virtual transitions, but we shall give a detailed explanation later in Section 6. 


According to relation (3.2), an electron may go over from level E; to Ez without getting any energy 


from outside; what is important is that it should quickly return to its initial level E1. Such a “journey” 
(E1 — E2 — E1) is possible if its duration At is such that the inequality i/At >> (E2 — E1) is satisfied, 
because in this case the uncertainty in the energy of the electron is greater than the difference in the 
energies of the levels under consideration. Hence it is clear that the statement “the electron occupies 
level E,;” may be understood quite specifically - as incessant “transition” of the electron from the given 
state to others with an inevitable return every time to the starting level Ey. Such transitions cannot he 
observed experimentally, and are called virtual transitions in contrast to the normal (real) transitions. 
During interaction of an electron undergoing virtual transitions with radiation, the electron is liable 
to change its “residence”. For example, it might now occupy level Ez and will in future perform virtual 
transitions not from level E4, but from level E2. If such a thing happens, the electron is said to have 
absorbed a photon of energy fiw = Ez — E1, and undergone a transition from level E; to E2. Virtual 
transitions don’t require any expenditure of energy from outside while a real transition cannot occur 
without expenditure of energy - the energy of the photons absorbed (or emitted) by electrons during 


interaction with radiation. 


To explain the difference between real and virtual transitions, we note that a real transition from a 
level E; to another level Ez and back may be broken up into two successive events in time (in between 
the electron may be experimentally registered in the intermediate state E2). However, the virtual 
transition from level E; to Ez and back cannot be broken up into two events in time - both parts of the 


transition must be considered as a single, indivisible process in time. 


The uncertainty relations used in quantum theory are by no means ex- Uncertainty Relation “Number 


hausted by relations (3.2)-(3.4). As an example of one more such relation of Photons-Phase” 


we consider the uncertainty relations for the number of photons and the 
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phase of the wave. 


Let there be a monochromatic radiation of frequency w. On one hand, 
it may be considered as a collective of photons each having energy fiw; on 
the other hand, we might treat it as a classical electromagnetic wave. Let 
N be the number of photons in the volume under consideration and let 
® = wt be the phase of the classical wave. The corpuscular characteristic 
of the radiation (number of photons N) and its wave characteristic (phase 
® cannot have definite values simultaneously; there exists the uncertainty 
relation 

ANA® > 1 (3.8) 


In order to arrive at relation (3.8), we start from the uncertainty relation 
for energy and time. We recall that for measuring the energy of a quan- 
tum object with an accuracy AE we must spend time At > RE’ If we take 


the collective of photons as the quantum object, we get 
AE = hwAN 


where AN is the uncertainty in the number of photons. During the time 
At necessary for measurement of the energy of the object with an accu- 


racy hwAN, the phase ® changes by A® = wAt. Substituting into this the 


h 
relation At > ———, we find 
hoAN 


1 


A® > — 
AN 


QED. 


Relation (3.8) reflects the dialectically contradictory unity of corpuscular 
and wave properties of radiation. The uncertainty A® is small when the 
wave properties of radiation are clearly exhibited; in this case the density 
of photons is high (N is large) and so is the uncertainty AN. On the other 
hand, the uncertainty AN is small when there are a few photons in the 
aggregate. In this case the corpuscular properties of radiation are clearly 


exhibited and therefore the uncertainty A® is large. 


4 Some Results Ensuing from the Uncertainty Relations 


The uncertainty relations serve as very useful working tools of quan- Evaluation of Energy of 
tum theory since they permit one to obtain important results, by fairly Ground State of Hydrogen 
simple means. Atom 


As an example, we consider the hydrogen atom in its ground state. We 
make use of the well-known classical expression for energy of a charged 


particle moving in a Coulomb field: 


2 2 
pet t (4.1) 
2m r 


where m and e are the mass and charge of the electron, respectively. In 
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order to use the classical expression (4.1) in the quantum theory, we con- 
sider the quantities p and r occurring in it as uncertainties in momentum 
and coordinates of the electron, respectively. According to relation (3.3), 
these quantities are connected with each other. We assume pr ~ hi, or, 
simply, 

pr=h (4.2) 


Eliminating the quantity r from (4.1) by using (4.2), we get 


2 2 
P _ eP 
E = — - — 4.3 
(=F - 5 (43) 
It is easy to see that the function E(p) has a minimum for a certain value 
p = pi. We denote it as E,. The quantity E; may be considered as the 


energy of the ground state of the hydrogen atom while the quantity rı = 


— is the estimate of the linear dimensions of the atom (in Bohr’s theory 
Pı 


d 
this is the radius of the first orbit). By equating the derivative y (p) to 


2 
me 
zero, we find pı = z This at once gives the required evaluations (cf. 


(2.6) and (2.7)]: 


he 
ri = 3 
pr 4 (4.4) 
E1 = -—~ 
2h2 


The values given by (4.4) fully coincide with the results of the rigorous 
theory.!* Of course such a complete coincidence must be considered to 
some extent as an accidental success. Only the order of the quantities 
should be taken seriously here. We emphasize that this order can be 
evaluated quite simply as follows: it is sufficient first to simply replace the 
precise values of the dynamic variables in expression (4.1) by quantities 
which characterize the degree of “blurring” of these variables, i.e. by their 
uncertainties, and then use the quantum-mechanical relations connecting 


the said uncertainties. 


We shall proceed exactly in the same way as in the preceding example. 
The energy of a classical one-dimensional harmonic oscillator is given by 


the expression 


p mox? 


2m 2 


E (4.5) 


Treating px and x as uncertainties in the momentum and coordinate of 
the oscillating microparticle and using the equality py - x = has the 
uncertainty relation, we get from (4.5) 

p mok 


E(px) = om + 2p2 





(4.6) 


d 
By equating the derivative ere to zero we find the value of pọ = 


+Vmho for which the function E(p,) assumes its minimum value. It is 


14n the rigorous theory the quantity 
rı is a characteristic for the ground state 
of the hydrogen atom and denotes the 
distance from the nucleus at which an 
electron is most likely to be observed (see 
expression (5.4)). 


Estimate of the Energy of 
Zero-point Oscillations of an 


Oscillator 
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easy to see that this value is 
Eo = E(po) = ho (4.7) 


This result is quite interesting. It shows that in quantum mechanics the 
energy of an oscillator cannot vanish; its minimum value is of the order of 
hw. This is the energy of what is called the zero-point oscillation. We note 
that the estimate (4.7) differs from the exact expression for the energy of 


the zero-point oscillation just by a factor of 7 (exact value Ey = z"o). 


By taking the zero-point oscillations into account, one may arrive at 
the following interesting conclusion: the energy of the oscillatory motion 


of atoms in a crystal does not vanish even at absolute zero. 


The zero-point oscillations illustrate a basically general circumstance: 
it is impossible to find a microparticle at the “bottom of the potential 
well”, or, in other words “a microparticle cannot fall to the bottom of the 
potential well” This conclusion does not depend on the form of the po- 
tential well, since it is a direct consequence of the uncertainty relation: 
“falling to the bottom of the well” is connected with the vanishing of mo- 
mentum and hence the uncertainty in the momentum of the microparticle. 
In this case, the uncertainty in the coordinate becomes so large that there 
is a direct contradiction with the very fact that the microparticle is in a 


potential well. 


The essence of the effect, investigated in 1958 by Keldysh, and inde- 
pendently by Franz, lies in the following: in a uniform external electric 
field, the minimum of the electron energy in the conduction band of semi- 
conductors shifts downwards on the energy scale, leading to a “blurring” 
of the edge of the fundamental optical absorption band (as a result the 
absorption of the photons with energies lower than the forbidden band 
width becomes possible).!° The value of the shift of electronic states char- 
acterizing the indicated “blurring” may be obtained in the same way as 
the preceding evaluations were made. We make use of the classical expres- 


sion for the energy of a charged particle in the electric field of intensity £: 


2 
E= 2 — ex (4.8) 
2m 


Here m is the effective mass of the electron in the conduction band. Treat- 
ing px and x as uncertainties in the momentum and coordinate of the 
electron and using the equality py - x =f. as the uncertainty relation, we 


get from (4.8) 


2 Beh 
Beds eS 





(4.9) 


d 
Next, as usual we equate the derivative Fp ex) to zero and obtain the 
Px 


Evaluation of the “Blurring” of 
the Optical Absorption Band 
Edge in the Franz-Keldysh 
Effect 


5 Vavilov, V. S. (1963). The Effect of Ra- 
diation on Semiconductors (in Russian). Fiz- 
matgiz, Moscow 
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value po = —V&ehm for which the function E(p,) assumes its minimum 


value: 


Ey 3 (@eh)’ 





NI% 


(4.10) 
3 (Eeh)? 


m 





Expression (4.10) gives an estimate of the extent of the “blurring” of the 


edge of the fundamental optical absorption band in the Franz-Keldysh 


effect. 
While postulating the stationary states, Bohr’s theory did not explain Why does not the Electron Fall 
why, after all, the electron, moving under acceleration, does not radiate into the Nucleus? 


and fall into the nucleus as a result of this. Relation (3.3) explains this 
fact. The falling of an electron into a nucleus would obviously mean a 
considerable reduction in the uncertainty of its coordinate. Before the 
hypothetical falling into the nucleus, the electron is localized within 

the limits of the atom, i.e. in a region of space with linear dimensions 

h? /me? =~ 1x 10° cm [see (4.4)], whereas after falling into the nucleus 
it would be localized in a region with linear dimensions less than 1 x 
107"? cm. According to (3.3) a stronger localization of a microparticle in 
space is linked with a “blurring” of its momentum. Hence upon falling 
into the nucleus, the mean value of the momentum of the electron must 
increase, which requires an expenditure of energy. Thus it turns out 
that effort has to be made not to “hold” the electron from falling into the 
nucleus, but on the contrary to “force” the electron to be localized within 


the nucleus. 


In the example of the zero-point oscillations it was pointed out that the 
microparticle in a potential well always possesses a non-zero minimum 
energy Eo. The magnitude of Ep depends, in particular, on the spatial 
dimensions of the well (or on its width a, which determines the extent 
of localization of the microparticle in space). By taking into account the 
uncertainty relation, it is easy to see that 

K2 


Eo x — 
ma? 


(4.11) 


If a decreases, Eo increases. For sufficiently small a, the energy Ey) may 
become greater than the depth of the potential well. It is obvious that 


such a well will not hold the microparticle at all. 


The falling of an electron into the nucleus corresponds to a decrease 
in the width of the potential well from 1 x 1078 cm to 1 x 10°” cm (and 
even lower). According to (4.11) the minimum energy Ep should increase 


in this case from 10 eV to 1 x 10° eV (and higher). As a result the minimum 
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energy of the electron turns out to be a few orders higher than the bind- 
ing energy of a nucleon in the atomic nucleus (the latter being not greater 
than 1 x 10’ eV). This means that the electron cannot ever be present in 
the nuclear potential well and hence it can by any means be compelled 

to be localized within the nucleus, not even by force. This not only elimi- 
nates the problem of “an electron falling into the nucleus” but also solves 
another fundamental question: the electron is not one of the constituents of 


the atomic nucleus. 


In order to draw the trajectory of a particle, it is necessary, strictly On the “Trajectory” of Mi- 
speaking, to know the coordinate and momentum of the particle at every croparticles 
moment of time (in fact, in order to depict the dependence x(t), it is 
necessary to know, for every t, the values x and dx/dt). Since, according 
to the uncertainty relation (3.3), a microparticle cannot simultaneously 
possess a defInite coordinate and a definite projection of the momentum, 
one can draw the conclusion that the concept of trajectory in case of 


microparticle, strictly speaking, is not applicable. 


The rejection of the trajectory concept is connected with the existence 
of wave properties in microparticles, which do not permit one to consider 
microparticles as classical corpuscles. The movement of a microparticle 
along the x-axis cannot be associated with the differentiable function 
x(t) which is so widely used when considering the mechanics of classical 
objects. From a known value of x of the microparticle at a certain time t, 


it is impossible to predict the value of its coordinate at the time t + dt. 


As applied to Bohr’s theory, this means a rejection of the very concept 
of “the orbit of an electron in an atom”. One may speak about the localiza- 
tion of the electron within the atom as a whole; the orbit requires a much 
greater spatial localization. Turning to the problem of “the electron falling 
into the nucleus” discussed above, we can understand the consequence 
to which such a localization leads. The planetary model of the atom thus 
turns out to be just an intermediate step in development of our concept of 
the atom. Much later, in the fifties, Bohr himself amusingly recalled how 
after one of his lectures a student came up and asked him “Were there 


really idiots who thought that the electron revolves in an orbit?” 


We note that with the rejection of the idea of orbits of the electron in an atom the contradiction re- 
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garding the problems of the instantaneous jump of the electron from one orbit into another, discussed 


in Section 2, is automatically eliminated. 


There are situations, however, in which one can make use of the idea 
of “the trajectory of a microparticle”. As an example we consider the 
motion of electrons in the kinescope of a television set. The momentum 


of the electron along the axis of the tube is p = V2meU, where U is the 
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accelerating voltage. The formation of the electron beam means a definite 
localization of the coordinate in the transverse direction. The degree of 
this localization is characterized by the diameter d of the beam. According 
to (3.3), there must be an uncertainty in the electron momentum in a 
direction perpendicular to the axis of the beam: Ap ~ fi/d. As a result 

of this uncertainty, the electron may deviate from the axis of the beam 
within an angle AO ~ Ap/p ~ h/pd. Let L be the path length of the 
electron in the kinescope. Then the uncertainty in the position of the 
point of impingement of an electron on the screen will be characterized 
by the quantity Ax ~ LA@ ~ Lh/pd. Assuming U = 20kV,d = 1 x 
10°-3cm,L = 1 X 10? cm, we get Ax ~ 1x 107” cm. In this way, the 
blurring of the point of impingement due to the uncertainty relation is 
considerably less than the diameter of the beam. It is clear that in such 


cases the motion of the electron may be treated classically. 


Suppose we have a potential barrier whose height U is greater than The Possibility of a Microparti- 
the energy of the particle (Figure 4.1). We ask the question: can a particle, cie Sub-Barrier Passage (Tun- 
situated somewhere to the left of the barrier, appear after some time to the neling Effect) 


right of it, without getting any energy from outside? Classical mechanics 
gives a negative answer — a classical corpuscle cannot pass through the 
barrier. If this were to happen, then the total energy of the particle, say, at 
the point A in Figure 4.1 would be less than its potential energy, which is 
physically absurd. 





Does this ban apply to microparticles as well? It can be shown that 
it does not - it is removed by relation (3.2). Let the microparticle move Figure 4.1: A microparticle tunneling 
from infinity to the right and encounter the potential barrier. Until this Pause 
encounter it was in a state of free motion for an infinitely long time and 
hence its energy had a definite value. But now the microparticle interacts 
with the barrier, or, more precisely, with the objects which caused the 
appearance of the barrier. Suppose the interaction lasts for a time At. Ac- 
cording to (3.2), the energy of the microparticle in a state of interaction 
with the barrier is no longer definite but is characterized by the uncer- 
tainty AE > h/At. If this uncertainty is of the order of the height U of 
the barrier, the latter stops being an unsurmountable obstacle for the mi- 
croparticle. Thus the microparticle may pass through the potential barrier. 
This specific quantum effect is called the tunneling effect. It explains, in 
particular, the a-decay of atomic nuclei. It should be noted that when con- 
sidering the tunneling effect, the motion of the microparticle cannot be 
represented by the dotted line in Figure 4.1. The dotted line corresponds 
to the classical trajectory, while a microparticle does not have a trajectory. 
Hence there is no point in trying to “accuse” the microparticle of having 


been “under the potential barrier” at some moment of time. 





Figure 4.2: A microparticle tunneling 


through a potential well. 
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It has been noted above that the energy of a freely moving microparticle is not quantized. This may be 
easily shown by making use of the tunneling effect. Suppose a microparticle is located in a potential 
well shown in Figure 4.2. On account of the tunneling effect the microparticle may of its own accord 
leave the potential well. Consequently, the time for which it stays in the well is not infinite. If we de- 
note this time as At, it follows from (3.2) that the energy of the microparticle must have an uncertainty 
of the order /At. We reduce the width b of the potential barrier (dotted line in Figure 4.2). It is clear 
that as a result the magnitude of At will decrease, since the probability of the rnicroparticle leaving 
the well will increase. With a decrease in At, the uncertainty in the energy of the microparticle, 4/At, 
will increase. This may be considered as a larger blurring (further broadening) of the energy levels of 
the rnicroparticle in the well. In the limiting case of zero thickness of the barrier, the value of At van- 
ishes, the microparticle becomes a freely moving particle, and the energy levels broaden up indefinitely, 


actually transforming into a continuous energy spectrum. 


5 Impossibility of Classical Representation of a Microparticle 


The process of “breaking up” of objects surrounding us into smaller A Microparticle Is not a Classi- 
and smaller “fractions” leads to microparticles. Therefore it is but natural cal Corpuscle 
to associate microparticles first of all with corpuscles. This is also sup- 
ported by the fact that a microparticle is characterized by a definite rest 
mass and a definite charge. For instance, it is meaningless to speak of a 
half-electron having half the mass and half the electric charge of a whole 
electron. The very terms “microparticle” and “elementary particle” reflect 


the notion of the microparticle as being some particle (corpuscle). 


However, as follows from the preceding discussion, a microparticle is 
considerably different from a classical corpuscle. Firstly it does not have 
a trajectory which is an essential attribute of a classical corpuscle. The 
use of such corpuscular characteristics as coordinate, momentum, angular 
momentum, energy when considering microparticles is restricted to the 
framework of the uncertainty relations. Interconversion of microparticles, 
spontaneous decays, the existence of an indestructible intrinsic moment 
(spin) and the ability to pass through potential barriers indicate that 


microparticles are quite dissimilar to classical corpuscles. 


Wave concepts are radically different from corpuscular concepts. 
Hence it is not surprising that the striking contrast between classical 
corpuscles and microparticle is explained by the existence of wave prop- 
erties in the latter. Moreover, it is the wave properties which account for 
the uncertainty relations and all the consequences resulting from them. In 


this respect, the following remark by de Broglie is worth noting!*: 16 de Broglie, L. (1930). Introduction a 
Etude de la Méchanique Ondulatoire (in 
for a century, the corpuscular method of analysis in optics was too much French) Hermann, Rari 
neglected in comparison with the wave method. Hasn’t the converse been 
the case in the theory of matter? Haven’t we thought too much of the 


“particle” picture and neglected the wave aspect far too much? 
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The question raised by de Broglie is fully justified. However, an excessive 
exaggeration of the wave aspect while considering microparticles should 
be avoided. We must remember that while on one hand a microparticle is 


not a classical corpuscle, it is similarly, on the other hand, not a classical 


wave. 
The analysis of one mistake which is committed quite often even Microparticle Is Not a Classi- 
these days when considering a simplified account of quantum mechan- cal Wave 


ics is quite instructive. We shall demonstrate this mistake through two 


examples. 


It is contended that the wave properties of an electron permit one Example 1 
to derive the conditions for the quantization of momentum which are 
postulated in Bohr’s theory. The “derivation” is done in the following 
way. Let 27r, be the perimeter of the n-th Bohr’s orbit. In this orbit, 


2ah 
an electron moves with de Broglie’s wavelength A, = fa" The basic 


n 
assumption lies in the fact that the perimeter of the orbit should contain 
n wavelengths Àn of the electron. Consequently, 27r, = nàn. This at once 


gives the desired condition for quantization of momentum : 
Pa'n = nh (5.1) 


It is stated that the wave properties of an electron permit a very simple Example 2 
derivation of the formula for the energy levels in a potential well, if we 

assume that a definite number of de Broglie half-waves are confined in the 

potential well (in analogy with the number of half-waves contained in the 

length of a string fixed at both ends) corresponding to different stationary 


states. Designating the width of the one-dimensional potential well by a, 
n 
we write a = Ea from which we get the desired result: 
nr h? 


n= ma (5.2) 





Both these final results [(5.1) as well as (5.2)] are correct; they are the 
same as the result deduced from strict theory. However, the “derivation” 
of these formulas must be considered to be unsound. In both cases in fact 
the same fundamental mistake has been committed: they are based on the 
wrong assumption that the electron in a potential well has a definite de 
Broglie wavelength, or, in other words, a definite momentum. However, 
according to (3.3), the momentum of a microparticle in a bound state is 
characterized by the uncertainty Ap > h/a. Since in the above examples 
p ~ h/A = h/a, it follows that the momentum is of the same order of 
magnitude as the uncertainty in momentum given by relation (3.3). It is 
clear that in such cases one cannot speak about any value of the electron 
momentum (and correspondingly of its de Broglie wavelength) even 


approximately.'” These examples demonstrate an obvious exaggeration 17 This question is considered in greater 
detail in Section 23 of this book. See also 
work by de Broglie. 
de Broglie, L. (1930). Introduction a l’ 
Etude de la Méchanique Ondulatoire (in 
French). Hermann, Paris 
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of the wave aspect. The identification of an electron in a potential well 


with a classical wave inside a “resonator” is incorrect. The picture of an 


electron wave in a “resonator” is the same kind of simplification as the 


picture of an electron-ball moving in a classical orbit. We shall return 


again to the question of waves in quantum mechanics; however, it is 


useful to emphasize at this stage that by the term “de Broglie wave” we 


do not conceal any sort of classical wave. It is just a reflection in our 


imagination of the fact that microparticles possess wave properties. 


If a microparticle is neither a corpuscle nor a wave, then may he it Attempts to Represent a Mi- 
is some kind of a symbiosis of a corpuscle and a wave? Several attempts croparticle as a Symbiosis of a 
were made to model such a symbiosis and thus also to visually demon- Corpuscle and a Wave 


strate the wave-particle duality. One such attempt represents a micropar- 


ticle as a formation, limited in space and in time. This may be the wave 


packet mentioned in Section 3. This may also be just a “scarp” of a wave, 


often called a wave train. Another attempt uses a model of a pilot-wave, 


according to which a microparticle is some sort of a “compound” of a 


corpuscular “core” with a certain wave which controls the motion of the 18 Bohm, D. (1957). Causality and 


core. 


Chance in Modern Physics. Van Nostrand, 
NY 
One of the versions of the pilot wave model is considered by D. Bohm in his book 18, We first postu- 
late that connected with each of the “fundamental” particles of physics (e.g. an electron) is a body existing 
in a small region of space ... in most applications at the atomic level the body can be approximated as 
a mathematical point ... The next step is to assume that associated with this body there is a wave with- 
out which the body is never found. This wave will be assumed to be an oscillation in a new kind of field, 
which is represented mathematically by the y-field of Schrödinger somewhat like the gravitational and the 


electromagnetic, but having some new characteristics of its own... 


We now assume that the -field and the “body” are interconnected in the sense that the /-field exerts a 
new kind of “quantum-mechanical” force on the body, the force is such as to produce a tendency to pull the 


body into regions where || is largest. 


If the above tendency were all that were present, the body would eventually find itself at the place where 
the y-field had the highest intensity. We now further assume that this tendency is resisted by random 
motions undergone by the body, motions which are analogous to the Brownian movement. They could, for 


example, come from random fluctuations in the -field itself. 


Once admitting the existence of these fluctuations, we then see that they will produce a tendency for 
the body to wander in a more or less random way over the whole space accessible to it. But this tendency 
is opposed by the “quantum force” which pulls the body into the places where the i-field is most intense. 
The net result will be to produce a mean distribution in a statistical ensemble of bodies, which favours the 


regions, where the iy-field is most intense... 


Figure 5.1 illustrates the given model as applied to the passage of a microparticle through the 
screen with slits: the y-wave is diffracted on both the slits, while the “body” passes through one slit 


and is registered on the screen in accordance with the result of the interference of y-waves. 
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It is not denied that such models could appear attractive at first glance 
if only because of their intuitive appeal. It must be emphasized at once, 
however, that all these models are baseless. We shall not explain at this 
stage the reasons for worthlessness of the pilot-wave model considered 
above, but shall just mention that it is cumbersome since it uses artificial 
notions such as the w-field which is “to some extent similar to gravita- 
tional end electromagnetic fields”, or the “quantum force” which reflects 
the interaction of certain “object” with the y-field. The reader will later re- 
alize that the worthlessness of such models is not because of some specific 
feature, but because of deep fundamental reasons. He will understand that 
any attempt at a literal interpretation of the wave-particle duality, any 
attempt to model a symbiosis of corpuscle and a wave, should be consid- 


ered fruitless from the very start. A microparticle is not a symbiosis of a 





corpuscle and a wave. 


Figure 5.1: Bohm’s pilot wave model. 


At present the wave-particle duality is considered as the potential How to Understand the Wave- 
ability of a microparticle to exhibit its different properties depending on Particle Duality 
external conditions, in particular, on the conditions of observation. As 


Fock wrote!’: 19 Fock, V. A. (1957). On the interpreta- 
tion of quantum mechanics. Czechoslovak 
Thus under certain conditions an atomic object may exhibit wave prop- a eae 
erties and under other conditions corpuscular properties; conditions are 
also possible when both kinds of properties appear simultaneously but not 
sharply. We can state that it is potentially possible for an atomic object to 
manifest itself either as a wave or as a particle or in an intermediate fash- 
ion, according to the external condition prevailing. It is just this potential 
possibility of exhibiting various properties inherent in an atomic object that 
constitutes the wave-corpuscular duality. Any other, more literal mean- 

ing attached to this duality, such as a wave-particle model of any kind, is 


incorrect. 


Let us consider a simple example. Let a beam of electrons pass through 
a screen with slits and then hit a detector screen. While passing through 
the slits, the electrons realize their wave properties which leads to a distri- 
bution for electrons beyond the slit characteristic of interference. When 
impinging on the detector screen, the electrons exhibit their corpuscular 
properties - each of them is registered at a certain point on the screen. It 
may be said that the electron passes through the slit as a “wave”, and is 
registered on the detector screen as a “particle”. In this connection, it is 
sometimes said that a microparticle is a wave under some circumstances, 
and a microparticle is a particle under other circumstances. Such a treat- 
ment of wave-particle duality is incorrect. Whatever the conditions, a 
microparticle is neither a wave, nor a particle, not even a symbiosis of 


a wave and a particle. It is a quite specific object, capable of exhibiting 
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corpuscular or wave properties to some extent or other depending on 
the circumstances. The understanding of wave-particle duality as the 
potential capability of the microparticle to exhibit different properties in 
different external conditions is the only correct one. Hence, in particular, 
follows an important conclusion: it is impossible to give a definite visual 


model of a microparticle. 


The absence of a visual model of a microparticle does not in any way Electron in an Atom 
prevent us from using tentative models quite suitable for representing a 
microparticle under different conditions. As an example, let us consider 


an electron in an atom. 


We recall that the state of an electron in an atom is described by a set 
of quantum numbers n, l, m, o. A given state is characterized by a definite 
energy, while in the particular case of the hydrogen atom, it depends 
only on the number n [see equation (2.7)], and in a more general case 
on the numbers n and /. An electron in an atom is delocalized in space- 
its coordinates have an uncertainty of the order of the size of the atom. 
Usually, when considering an electron in an atom, it is customary to 
introduce the concept of an electron cloud which may be interpreted in 
this case as a tentative image of the electron. The form and the effective 
size of an electron cloud depend on the quantum numbers n, l, m and, 


consequently, vary from one state of the electron in the atom to another. 


In order to describe the dimensions and the form of the electron cloud, 


we introduce a certain function 


Unim(, 9,6) = oni (r) Zim(4, @) (5.3) 


where r, 0, ọ are the spherical coordinates of the electron. The function 
Unim is interpreted in the following way: unım(r, 0, y) dV is the probabil- 
ity of finding an electron in a state with quantum numbers n, l, m in an 
element of volume dV in the vicinity of point (r, 0, y). In other words, 
Unim(1, 0, @) is the corresponding probability density of finding the elec- 
tron. Remember that dV = r?drdQ, where dQ = sin Odédg is the element 


of solid angle. The function 
wy (r)dr = vni (r) r°dr (5.3a) 


is thus the probability of finding the electron with quantum numbers n, | 


at a distance between r and r + dr from the nucleus. 


In Figure 5.2 (a) are shown forms of functions w,,;(r) for different states 
of the electron in a hydrogen atom. Notice that the functions w10, w21, w32 
have maxima corresponding to the radii of first, second and third orbits in 
Bohr’s theory. Figure 5.2 (b) shows forms of function Zym for some states 


of the electron. For l = 0 (for the so-called s-electron) we have a spherical 
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Figure 5.2: Different states and forms 
of their functions for the electron in a 


electron cloud. For l = 1 (for p-electron) the electron cloud has a form of hydrogen atom. 
either a spindle or a toroid depending on the quantum number m. Thus 
in order to imagine an electron in atom, one may use conventional forms 


like the models of a sphere, spindle, toroid, etc. 


20 Blokhintsev, D. I. (1964). Principles of 
The ground state of the hydrogen atom is characterized by a spherical oe Mechanics. D. Reidel Publishing 
o 


electron cloud. Theory shows (see, for example, Blokhintsev 205 that in 


wn (r) = 4 (3) exp (-=} (5.4) 
r 


1 ri 


this case 


The parameter rı characterizing the effective radius of the cloud is deter- 
mined by relation (4.4); in Bohr’s theory it occurs as the radius of the first 


orbit. 


In conclusion we note that during quantum transitions in an atom, 
there occurs not only a change in energy, but also a “redistribution” of 


electron clouds - a change of their shape and size. 


6 Rejection of Ideas of Classical Physics 


As has been asserted, a transition from macrophenomena to microphe- General Remarks 
nomena presupposes a rejection of the basic ideas of classical physics. The 


notion of a strict continuity in the spectrum of values of physical quanti- 
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ties is no longer valid, the classical concept of a trajectory is rejected, the 
principle of classical determinism is in question. At the root of this view- 
point lie ideas of quantization (discreteness) and wave-particle duality 


which are alien to classical physics. 


On the basis of a number of examples that we have considered, we 
could see the need to reject the classical principle of unlimited detailing 
of objects in space, and of phenomena in time. Thus the question of the 
internal structure of elementary particles turns out to be groundless. 
Likewise the efforts towards a detailed development in time of the process 


of the quantum jump (quantum transition) have no meaning. 


The concepts of energy momentum, angular momentum which are 
widely used in classical physics are carried over to quantum mechanics as 
well. However, these concepts are now seen differently with a reconsider- 
ation of the previous interconnections, taking into account the possibility 
of quantization, and the limitations imposed by the uncertainty relations. 
In particular, there arises the question, unknown to classical physics, of 
the simultaneous measurement of physical quantities, and questions about 
the state and the methods of describing a state are put in a new light. In 
conclusion we stress the impossibility of the classical interpretation of a 


microparticle, and the loss of the clarity of classical physics. 


The rejection of the classical individualization of an object is quite fun- Identity of Microparticles 
damental. In classical mechanics objects are known to have individuality 
since it is always possible in principle to enumerate them and observe 
the behaviour of anyone of them. In this case, however alike two classical 
objects may be, they are never identical and can always be distinguished. 
But in quantum mechanics two microparticles of the same type should be 
treated as absolutely identical. Thus, all electrons are identical and so are 


all unexcited hydrogen atoms, helium nuclei, etc. 


Suppose we have several electrons, one of which is “assigned” the number 1 at the moment of time 
t = 0. Can this electron be identified after a certain time t? Such an identification could have been 


easily done if we could put some “label” on this isolated object. We could get by without “labelling” 
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this electron if we could simply keep a watch over the isolated object, i.e. if we could “mentally” follow 


it (in our imagination) along its trajectory. This is precisely what we would have done in the case 
of isolated classical objects. However, none of this holds in the case of an electron; it is in principle 
impossible to “label” it. Strictly speaking, it has no trajectory. The electron “isolated” by us at the 
instant t = 0, cannot be isolated in actual practice: it does not have the individuality which would 
allow it to be identified in the assembly of electrons after a certain time t: Two electrons are much 
more “like each other” than the proverbial “two peas in a poll”; since the latter are classical objects, 


they could differ in size or in chemical composition in some way. 


It is understood that the identity of microparticles does not exclude the 


possibility of their differentiation on the basis of different states in which 
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these particles may he found. Two electrons belonging to two different 
atoms are, of course, identical but at the same time distinguishable. The 
affiliation of an electron to one atom or another permits its “isolation”. 
However, nothing changes physically if the electrons interchange places. 
It is clear that if such an exchange is possible, for example, by combining 
the atoms under consideration into a molecule, the distinguishability of 


the electrons vanishes. 


Laplacian determinism excludes the element of chance from the be- 
haviour of an isolated object; in classical mechanics, necessity completely 
dominates. Because of this, the laws of classical mechanics are dynamic 
and not statistical. The element of chance (and, consequently, statistical 
laws also) appear in classical physics only when considering aggregates of 


objects or assemblies of particles. 


From this point of view, it is important to stress that in quantum me- 
chanics we are dealing with a qualitatively different situation concerning 
the behaviour of individual microparticles: here, elements of necessity, as 
well as chance, are present. An excited atom spontaneously returns to the 
ground state without any external influence. This is associated with the 
spontaneous transitions of electrons in the atom from one set of energy 
levels to another. It is impossible in principle to indicate precisely when 
a particular excited atom will return to its ground state; such a return is 
a random act. Precisely in the same way, it is impossible to predict ex- 
actly when a given elementary particle, for example, a neutron, will decay 


spontaneously. In this case also an element of chance is present. 


In addition to elements of chance, there are also present the elements 
of necessity in the behaviour of a microparticle. As has already been indi- 
cated in Section 1, if we have No neutrons at the instant t = 0, No > 1, 
we can confidently state that at the time t we will be left with only 
No exp (--) neutrons, T being a constant called the lifetime of the neu- 
tron. Here. the necessity is obvious. In the case of an individual neutron 
this necessity is replaced by a definite probability of keeping the neutron 
intact until time t, once it has managed to survive to time t = 0. This 
probability is equal to exp (--). It should be noted that this probability 
is independent of the time for which the given neutron has survived up 
to the time t = 0. Necessity is also manifested in the conservation laws 
which govern decay processes as well as the processes of interconversion 
of microparticles in general. We may also mention the fact that there are 


definite modes of decay; for example, a free neutron may decay into a 


proton, an electron and an electronic neutrino. 


The existence of chance as well as necessity in the behaviour of an indi- 
vidual microparticle has very important consequences. It leads to the fact 


that quantum mechanics turns out to be in principle a statistical theory 


Chance and Necessity in the 


Behaviour of a Microparticle 
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with probability as one of its basic attributes. As Fock ° has remarked, 21 Fock, V. A. (1957). On the interpreta- 
tion of quantum mechanics. Czechoslovak 


in quantum physics the concept of probability is a primary concept and Journal of Physics, 7:643-656 


plays a fundamental role.” It could be said that the behaviour of an in- 
dividual microparticle is random, but the probability of this behaviour 


is necessary.” The electronic cloud considered in Section 5 may serve 22 Here, it is quite appropriate to recall 
the words of F. Engels: Necessity emerges 


as a good example of this. The occurrence of an electron at some point Fr orrewsitiini the framework of randditness: 


near the nucleus is a random event, but the probability of its being found 
at a given point (r, 0, g) is definite - it is described by a function of the 
type (5.3) or, in other words, is determined by the shape and size of the 


corresponding electron cloud. 


In the end, we note that the element of chance in the behaviour of an 
individual microparticle is due to the uncertainty relations. In Section 4, 
we concluded on the basis of relation (3.3) that it is impossible to “aim a 
microparticle to hit a given point”. In other words, the registration of a 
particular electron at some point of a detector screen is random; we can 
only speak of the probability of such a fact. In Section 3, we introduced 
on the basis of relation (3.2) the concept of the virtual transitions of a mi- 
croparticle. It is easy to see that such transitions also point, towards the 
existence of randomness in the behaviour of a microparticle. When dis- 
cussing the specific nature of the physics of microparticles, it is necessary 
to discuss in greater detail the idea of virtual transitions and of virtual 


microparticles associated with them. 


Perhaps there is nothing more alien to classical physics than the idea Virtual Transitions and Virtual 
of virtual transitions and virtual microparticles. The virtual transition Microparticles 
of an electron from level E; to level E; and back (the transition E) —> 
E, — E2) may be considered as a process in which the electron emits and 
absorbs a photon of energy (E — E1). Such a photon is called virtual. In 
contrast to the photons participating in real transitions, virtual photons 
cannot be observed experimentally. The creation of a virtual photon is not 
connected with an absorption of energy from outside, and its annihilation 
is not connected with a release of energy. The law of conservation of 
energy is not violated since a virtual photon exists for a very short time 
At and, according to uncertainty relation (3.2), the energy of an electron 
emitting the virtual photon is characterized by the uncertainty AE > A 
which may be of the order of, or greater than, the energy of the photon 
(E2 — E1). The emission or absorption by an electron of virtual photons 
corresponds, from a physical point of view, to the process in which an 


electron undergoes virtual transitions. 


Taking into consideration the emission and absorption of virtual pho- 
tons by an electron, one may imagine that each electron is surrounded 
by a photon cloud. This “cloud” should be compared with the electron’s 


own electromagnetic field. Two electrons may exchange virtual photons. 
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In quantum field theory, the interaction of electrons is seen as a result of 
the exchange of virtual photons between electrons. For this we frequently 
make use of Feynman’s diagrams, which enable us to consider the various 


processes of photon exchange. 


Figure 6.1 shows four Feynman’s diagrams demonstrating the scatter- 
ing of one electron by another. The solid lines “show’ electrons and the 
dotted one - photons. The intersections of solid and dotted lines are called 


the vertices of the diagram. 


Let us consider the diagram (a). Here 1 and 2 are the electrons before 
interaction with each other (before scattering), AB is a virtual photon 
which is exchanged by the electrons during the process of interaction 
(note that all the particles indicated in the diagram by lines connecting 
two vertices are virtual); 3 and 4 are the electrons after scattering. Let us 
turn to the diagram (b). Here 1 and 2 are electrons before scattering, AB 
and CD are virtual photons exchanged by the electrons, 3 and 4 are virtual 
electrons, 5 and 6 are electrons after scattering. The diagram (c) is of the 
same type as diagram (b); here the electrons exchange two photons. The 
diagram (d) shows one of the processes in which the electrons exchange 
three photons. It is obvious that there is an infinite number of such dia- 
grams which become more and more complicated (with the participation 


of more and more photons). 


In order to calculate the probability of scattering of an electron by an 
electron, one must consider in principle the contribution of the various 
processes indicated by the various diagrams. Fortunately, the contribution 
of different processes is different: it is less if the number of vertices is 
greater (i.e. if more virtual photons take part in a process). Theory shows 


that this contribution is quantitatively determined by the dimensionless 
2\2 


quantity | — | , where e is the charge of the electron, c is the velocity of 
he 


light, A is Planck’s constant and n is the number of vertices in the diagram. 


Since A = = it follows that the main contribution to the scattering 

of one electron by another must come from the diagram (a) with two 
vertices (exchange of one photon). The four vertices diagrams (b) and 

(c) (exchange of two photons) should provide the next approximation; 
their contribution will be two orders of magnitude lower. Thus, in actual 
practice there is no need to consider a very large number of diagrams, it is 
sufficient to limit ourselves to diagrams with a relatively small number of 


vertices. 


Of course, a systematic study of Feynman’s diagrams and. calculations 
based on them is beyond the scope of this book. These questions are re- 
lated not to quantum mechanics, but to quantum field theory (quantum 


electrodynamics)”*. However, a general introduction to the ideas form- 


sE 


Figure 6.1: Scattering of electrons as 
shown in Feynman diagrams. 


234 simple and detailed account of 


Feynman’s diagrams is given, for example, 


in (Cooper, 1968). 

Cooper, L. N. (1968). An Introduction 
to the Meaning and Structure of Physics. 
Harper and Row, N.Y 
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ing the bases of Feynman’s diagrams is quite appropriate at this point, 

since they permit us to emphasize the specific nature of the physics of mi- 

croparticles and also demonstrate some fundamental quantum-mechanical 

principles (the latter will be considered below, in particular, in Section 2). 
Before ending this discussion on Feynman’s diagrams, we consider the Figure 6.2 Virtualicreatión:of elector: 

so-called effect of polarization of a vacuum. Figure 6.2 shows a diagram positron from a photon. 

describing one of the processes responsible for this effect. A photon is 

transformed into a virtual electron-positron pair, which is annihilated 

and transformed again into a photon (one of the solid lines between 

the vertices of the diagram “shows” a virtual electron, and the other, a 

virtual positron). The members of this pair during their lifetime may ob- 

viously generate virtual photons and, consequently, new virtual electron- 

positron pairs, and so on. As a result of this, the vacuum turns out to be 

not “empty” but “filled” with virtual electric charges which must exercise 

a screening effect on external (real) charges. Experimental confirmation of 


this effect is the best evidence of the usefulness of our concept of virtual 


particles. 
As we have already mentioned, the existence of the element of chance The Microparticle and Its 
in the behaviour of a microparticle is one of its most specific properties. Surroundings 


As a result of this, quantum mechanics turns out in principle to be a 
statistical theory operating with probabilities. But what is the reason for 


the existence of an element of chance in the behaviour of a microparticle? 


This question can be answered as follows: The existence of chance 
in microphenomena is explained by the fact that a microparticle, figura- 
tively speaking, interacts with all its surroundings. The specific nature of 
quantum mechanics is such that, strictly speaking, not a single object in it 
can be considered to be fully isolated, completely independent of its sur- 


roundings. It has been remarked”* that the cause of the statistical nature 24 Myakishev, G. Y. (1973). Dynamic 
and Statistical Laws in Physics (in Russian). 


of quantum mechanics is the same as in classical statistical mechanics, esa 


i,e. the existence of a large number of bonds affecting the motion of the 
object. A particle treated as free in quantum mechanics is in fact free only 
from influences of a dynamic nature. But it remains under the influence 
of random forces which cause quantum fluctuations in its behaviour, as 


reflected by the uncertainty relation. 


What is the nature of the random influences on a microparticle? In 
quantum field theory, it manifests itself in an explicit form as the in- 
teraction of a microparticle with a vacuum (recall that a vacuum is not 
“empty”; it is “filled” with virtual charges). It may be said that a micropar- 


ticle interacts with its surroundings through virtual microparticles. 


The reader should now find it quite natural to interpret wave-particle 


duality as the potential ability of a microparticle to exhibit one kind of 
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property or another, depending on the external conditions, i.e. on the mi- 
croparticle’s surroundings. This envisages a close connection between the 
microparticle and its surroundings - in fact, the very nature of a micropar- 
ticle is displayed in one form or another depending on specific conditions 


and circumstances. 


The impossibility of an unlimited detailization of objects and phenom- 
ena being displayed in quantum mechanics should also be explained by 
the interaction of a microparticle with its surroundings. This means that 
after a certain stage of investigation, physical objects cannot be consid- 
ered as being isolated. Here it is appropriate to recall the statement given 


in Section 3 regarding the discussion on quantum transition: 


During the interaction of an electron with photons there is, strictly speak- 
ing, no electron and no photon but a single entity which must be consid- 


ered as such without going into details. 


Quantum mechanics re-establishes the idea acquired through everyday 
experience regarding the unity of the universe and general connections 
among phenomena. This idea received a setback in the classical theory. 
The sharp boundaries that existed between waves and particles, particles 
and fields, object under investigation and the medium are all obliterated 
and the concept of the interconversion of matter is introduced. We find 
ourselves in full agreement with the following appropriate remark made 
by Bohm(Bohm, 1951): 


It seems necessary to give up the idea that the world can correctly be anal- 
ysed into distinct parts, and to replace it with the assumption that the 
entire universe is basically a single, indivisible unit. Only in the classical 
limit can the description in terms of component parts be correctly applied 
without reservations. Wherever quantum phenomena play a significant 
role, we shall find that the apparent parts can change in a fundamental 
way with the passage of time, because of the underlying indivisible connec- 


tions between them. Thus, we are led to picture the world as an indivisible, 


but flexible and everchanging, unit.2° 25 Bohm, D. (1951). 


Prentice-Hall, N.Y 


Quantum Theory. 
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Interlude: Is a “Physically Intuitive” Model of a Microparticle Pos- 
sible? 


It may well be that these electrons 

Are worlds just like our very own, 

With kings and scholars, arts and armies, 
And memories of ages flown. 

And atoms - cosmic systems, spinning 

Around a central spinning sphere. 

Where things are just like ours, but smaller, 
Or nothing like what we have here. 


- Bryusov (The World of the Electron) 


AUTHOR: The impossibility of the classical interpretation of a mi- 
croparticle predetermines a negative answer to the question “Is it 


possible to have a “physically intuitive” model for a microparticle?” 


CLASSICAL PHYSICIST: It is still not clear why a “physically intu- 
itive” model, of a particle explaining its various properties including 
spin, instability, wave properties, etc. cannot be created. Such a 
model may turn out to be complicated. Or, it may be possible that 
we still do not know enough about a microparticle to create such 
a model. But why can’t we believe in the very possibility of this 


model? 


AUTHOR: There are very sound reasons for this. I shall indicate just 
two of them. Firstly, any modelling envisages in the long run a 
detailization irrespective of whether it is a model of an object or a 
process. However, the impossibility of an unlimited detailization 
is characteristic of microparticles and microphenomena, as we 
have already mentioned. This important situation was persistently 
stressed by Bohr. He wrote, in particular (see his article Quantum 
Physics and Philosophy”): 

A new epoch in physical science was inaugurated, however, by 

Planck’s discovery of the elementary quantum of action, which 

revealed a feature of wholeness inherent in atomic divisibility of 

matter. Indeed, it became clear that the pictorial description of 
classical physical theories represents an idealization valid only 
for phenomena in the analysis of which all actions involved are 


sufficiently large to permit the neglect of the quantum... 


It is appropriate to mention here that this feature of wholeness indi- 
cated by Bohr is closely linked with the identity of a microparticle. 
Secondly, as we have already indicated, a characteristic property of 
microparticles is their inavoidable interaction with surroundings 


leading, in particular, to a dependence of some of the properties of 


Participants: (same as in Pre- 


lude). 


26 Bohr, N. (1963). Essays 1958-1962 
atomic physics and human knowledge. 
terscience, N.Y., Interscience, NY 
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microparticle on definite external circumstances. These properties 
should be treated as certain possibilities which can be realized de- 

pending on the external circumstances. One may ask, in what way 
can these possibilities be reflected in the framework of a definite 


“physically intuitive” model? 


CLASSICAL PHYSICIST: It must be admitted that these ideas serve 
as strong arguments against a “physically intuitive” model of a 
microparticle. However, I don’t like the very spirit of quantum 
mechanics which rejects graphic representations. In my view it in- 
troduces subjectiveness in describing real world. Take, for example, 
the statement: “The electron may present itself as a wave or as a 
particle, depending on circumstances”. Now, everything depends 
on circumstances, especially on the circumstances of observation. 
Involuntarily, one gets the idea that the electron is not something 
objective, but rather something subjective depending on how we 


“look” at it. 


AUTHOR: Of course, this is not true. First of all, you overlook the 
fact that the electron has quite definite characteristics like rest mass, 
electric charge, spin, etc. It is stable and is a fermion. As regards 
a “physically intuitive” model of an electron, well, it simply does 
not exist. However, in rejecting a “physically intuitive” model of a 
microparticle, quantum mechanics in no way sacrifices objectivity 
in favour of subjectivity. It is just that the electron is a very compli- 
cated physical object, and depending on the external circumstances, 
including circumstances of observation, it exhibits its different as- 
pects, which objectively existed in potential form (I stress this) even 
before the observer was born. A sober assessment of this complex 
situation is that a “physically intuitive” model of the electron is not 


possible. 


CLASSICAL PHYSICIST: Can one seriously speak about an object 
without having an idea of what it looks like? Isn’t it strange that we 
study, for example, the behaviour of an electron in a crystal while 


we don’ t even know what an electron actually is? 


AUTHOR: I don’t agree that we don’t even know what an electron 
is. I have just indicated a number of precisely determined charac- 
teristics and properties of an electron. More detailed properties of 
microparticles in general and electrons in particular were consid- 
ered in the preceding sections of the book (and will be considered in 
the following sections). In fact, we know quite a lot about the elec- 
tron and know, in particular, about its behaviour in a crystal. This is 
evidenced by the large number of semiconducting devices fabricated 


and used by us in practice. As you will see, the absence of a “phys- 
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ically intuitive” model of the electron has in no way turned out to 
be a serious obstacle. We may even go a step further and state that 
an understanding of natural phenomena in which Planck’s constant 
plays an important role is possible only through a significant rejec- 
tion of a graphic description. By the way, this idea was given by 
Heisenberg, in whose works much attention was given to questions 


of the use of physically intuitive methods. 


CLASSICAL PHYSICIST: But by rejecting models, isn’t quantum 
mechanics running the risk of losing its material basis? Won’t we be 


finally left with only equations and abstract mathematical symbols? 


AUTHOR: I can understand your doubts. For you, apparently, only 
the extremes matter: either graphic models, or mathematical ab- 
straction. To you, either a model should reflect everything or almost 
everything, otherwise it is quite useless. The doubts arising in your 
mind are a consequence of precisely this type of viewpoint. How- 
ever, the quantum-mechanical approach to such questions is more 


flexible, or rather, dialectical. 


CLASSICAL PHYSICIST: I don’t understand what exactly you mean 
by that. 


AUTHOR: I want to stress two points. Firstly, although there is no 
“physically intuitive” model for a microparticle, this does not stand 


in the way of the “model representations” in quantum mechanics. 
CLASSICAL PHYSICIST: That means I was right after all? 


AUTHOR: This is something different. Quantum mechanics believes 
that even the most refined model cannot reflect the specific charac- 
teristics of a microparticle. Hence quantum mechanics makes use 
of tentative models (tentative images) - sometimes one, sometimes 
another, admitting the relativity of every model. The only thing 
that is important is that each of the models employed should reflect 
some side of the nature of the object or phenomenon. Thus, when 
considering electronic transitions through the forbidden band in a 
semiconductor, we unhesitatingly imagine electrons as some kind 
of corpuscles, which “jump” on the energy scale. When considering 
the propagation of electrons through an ideal crystal lattice we use 
wave concepts. It is convenient to study the scattering of electron 
waves by the elastic waves of a crystal in a “corpuscular language” 
using the picture of collisions of corpuscles of two types — electrons 
and phonons. Similarly, the image of the electron cloud used for 
describing electrons in an atom also serves as a good example of 


tentative modelling. As you see, modelling in quantum mechanics 
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is used quite extensively and flexibly. Moreover, all models are not 


interpreted literally but tentatively. 
CLASSICAL PHYSICIST: All right. And what is your second point? 


AUTHOR: The second point is as follows: Quantum mechanics makes 
use of both tentative models and mathematical abstractions on an 
equal footing. Just on equal footing. At this point modern physics 
breaks off quite radically from classical concepts. Stressing the great 
heuristic (and leading) significance acquired by mathematics in the 
new physics, which was not the case earlier in the epoch of the 


domination of “physically intuitive” concepts, Vavilov?” writes: 27\Vavilov, S. 1. (1970). Lenin and Modern 
Physics (in Russian). Nauka, Moscow 


we don’t have enough accepted ideas and concepts for a physi- 

cally intuitive model interpretation, but logic with its immense 
perspectives represented in mathematical form continues to be valid, 
thereby creating order in the new, unexplored world and opening 


possibilities for physical predictions. 


CLASSICAL PHYSICIST: Asa matter of fact, there is nothing definite 


in all this. 


AUTHOR: To be more precise, there is nothing predetermined before- 
hand. The new physics turns to a study of the objective world, if 
one may say so, “without classical prejudices”. It flexibly makes 
use of different media: models and mathematical abstractions. 
Figuratively speaking, it is not “alien to anything that is human”. 
Summing up, we may say that firstly, when studying microphenom- 
ena, we do make use of visual models quite extensively. Secondly, 
models are by no means taken literally in quantum mechanics; their 
relativeness and arbitrariness are considered. Thirdly, getting ac- 
quainted with microphenomena is based on the dialectic unity of 


model concept, and mathematical abstractions. 


2 Physical Foundations of 


Quantum Mechanics 






POTS 
FON 
res TNN 
eh BIN 


W A 4 
CY 





WA 
Hk A S LEP Slee: z fi ` 


PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 69 


7 Some Basic Experiments 


The concepts of quantum mechanics are based on a vast collection of 
experimental data gathered over a period of more than fifty years at the 
end of the 19 century and in the first half of the 20 century. Among 
the large number of experiments, a few stand out as being definite “mile- 
stones” and can hence be called decisive. They include the experiments of 
Lummer and Pringsheim on black body radiation coupled with Planck’s 
theoretical works (1900), the experiments of Frank and Hertz (1914) on in- 
elastic collision of electrons with atoms, Millikan’s experiments (1914) on 
the photoelectric effect, confirming the laws predicted earlier by Einstein, 
the experiments conducted by Stern and Gerlach (1921) on the splitting 
of atomic beams in non-uniform magnetic fields, the measurements of 
wavelengths of X-rays scattered by matter carried out by Compton (1923), 
and the experiments of Davisson and Germer, and Tartakovsky (1927) 
on electron diffraction.! These experiments (and many others which did 
not become so famous) constitute the foundation on which, over a num- 
ber of decades, quantum theory was built, perfected, freed from various 


paradoxes, and finally brought to its present harmonious structure. 


Looking now from the position of the existing quantum theory at the 
experimental quest which led to it, it is worth generalizing the actual 
experimental picture by omitting the details that do not play a signifi- 
cant role and trying to conceive the simplest system of basic experiments 
which describe the fundamental aspects of the quantum-mechanical view- 
point. In this section an attempt has been made to consider such a system 
of experiments. This system is built on the basis of actual experiments but 
one should not look for a one-to-one correspondence between the basic 
experiments and actual experiments conducted at a certain time in a cer- 
tain laboratory. Basic experiments must be seen as a sort of generalization 
of several actual experiments. Hence, the experimental details concerning 
a certain apparatus or various details of a historical nature do not play a 


significant role here. 


In our view, resorting to the system of basic experiments is motivated 
by two circumstances. Firstly, being free from the details of actual experi- 
mental researches and their unavoidable “zigzags” and “deadends”, such a 
system of experiments permits one to isolate the main events prominently, 
and clearly show the experimental foundations of the theory. Secondly, 
quantum-mechanical ideas have so radically changed our views on the 
structure and properties of matter that it would not be proper to draw 
final conclusions on the basis of particular individual experiments (even 
on “decisive” experiments) but only on their totality. It is essential to re- 
consider the totality of experiments as a whole, and for this purpose it is 


useful to conceive a system of basic experiments. 


Actual Experiments and the 


System of Basic Experiments 


1A description of these experiments 
may be found, for example in Trigg. 
Trigg, G. L. (1971). Crucial Experiments 
in Modern Physics. Van Nostrand, N.Y 
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Let us begin by considering the well-known experiment on the in- Experiment 1 (Microparticles 
terference of light waves. Figure 7.1 schematically shows the simplest in an Interferometer) 
interferometer. Here, 1 is a point source of monochromatic light, 2 is a 
screen with two small slits A and B, and 3 is a detector screen registering 
the intensity of light impinging it. This intensity indicated on the diagram 
by the curve I(x). The interference character of the curve I(x) is fairly 
simply explained within the framework of classical wave theory of light: 
the light wave from source 1 upon reaching is the screen 2 converts the 
slits A and B into sources of new light waves, which add up to give on 


screen 3 the characteristic interference pattern of intensity distribution. 





We recall that interference of light was observed in the middle of 
the 17" century by Grimaldi, and its explanation on the basis of wave 
concepts was given in the beginning of the 19™ century by Young. Since 


then, the experiment shown in Figure 7.1 is called Young’s experiment. 


CAV OT EN : 
One might ask what relation can the phenomenon of the interference 7 7 AN 
of light, discovered and explained long ago, have with quantum mechan- 
ics? It turns out that the two are directly related. 

1 


Let us gradually reduce the intensity of light from source 1. The il- 
lumination of screen 3 as a result will naturally decrease. However, the 
interference character of the curve I(x) will be retained. By increasing Figure 7:1: The doubierslit experiment: 
the time of exposure, it is possible in principle to obtain the interference 
curve I(x) for even the smallest light intensities. This is not trivial since 
with decreasing intensity of the light beam the number of photons in it 
will decrease and so, obviously, a situation should arise when individual 
photons will have to be considered in place of light waves. However, as 
has been shown experimentally, the nature of the interference curve I(x) 
obviously remains unchanged no matter how much the light intensity 
is decreased. The distribution of the individual photons falling on the 
detector screen gives the same interference pattern on the screen as is 


produced by light waves. 


Moreover, the interference is observed even if at point 1 ( Figure 7.1) 
we place a source of monochromatic electrons (all having the same en- 
ergy). In this case also the intensity of the electron beam may be reduced 
indefinitely. We can even perform an experiment in which the electrons 
pass through the interferometer one by one. By studying the distribu- 
tion of the electrons falling on the detector screen over a sufficiently 
long exposure time we get in this case also the characteristic interference 
pattern [curve I(x)]. Experiments repeated with other microparticles 
(protons, neutrons, etc.) lead to similar results. From an observation of 
the behaviour of microparticles in the interferometer it should be con- 
cluded that, firstly, the phenomenon of interference is inherent in all 


microparticles and, secondly, it should be explained by the properties not 
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of ensembles of microparticles but of individual microparticles. 


We shall try to “follow” the motion of an individual microparticle (say, 
electron) in the interferometer shown in Figure 7.1. The electron emerges 
from point 1, passes through the slits in screen 2 and is finally registered 
at a certain point x on screen 3. By repeating the experiment for a large 


number of single electrons we notice two fairly interesting facts. 


The first fact is the impossibility of predicting precisely at what point 
x a particular electron will be registered. The experimental conditions 
are the same for each electron (remember that the electrons pass through 
the interferometer one by one) and yet each electron “behaves in its own 
way”; moreover, one cannot predict the way in which it will behave. This 
remark applies to every single electron. However, by following a large 
number of electrons, we observe a pattern in the distribution of their 
incidence on the screen 3, shown by a kind of the interference curve 
I(x). Moreover it is immaterial whether we observe the distribution of 
the incidences of a large number of single electrons or the distribution 
of the incidences of electrons from a beam. Thus, the unpredictability 
concerning the behaviour of an individual microparticle is associated 
with the predictability concerning the behaviour of a large number of 


microobjects. 


The second fact is connected with the specific nature of the passage 
of an electron through the slits in the screen. Let us close slit B; in this 
case we observe the distribution of incidences on the screen, as described 
by curve Å (x) (Figure 7.2). Let us open slit B but close slit A; in this case 
the distribution I(x) is observed. By opening both slits, we do not get 
the additive distribution I, (x) + I(x) described by the curve I;(x) in the 





diagram but the interference distribution I(x) which was noted earlier. It 


is this fact that is especially remarkable. If we suppose that each electron 


Figure 7.2: The double slit experiment, 


passes through anyone of the slits, the appearance of the interference with one slit closed. 
distribution I(x) forces us to admit that the electron in some way “per- 
ceives” the other slit, otherwise, it does not matter for an electron passing 
through one slit whether the neighbouring slit is open or closed and thus 
the distribution of incidences with both slits open must be described not 
by the interference curve but by the additive curve B(x) = L(x) + (x). 
The electrons passing through slit A should give the distribution I, (x), 
while those passing through slit B should give the distribution I(x). The 
detector screen should then register the sum of these distributions. Since 
it is meaningless to talk about the electron “perceiving” we have to admit 
that the interference distribution I(x) observed with both the slits open 
is associated with the passage of the electron somehow simultaneously 


through the two slits. 


However, this admission does not simplify matters since it is not clear 
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exactly in what way a single electron can pass simultaneously through 
two slits. Of course, we may assume that the electron at first passes 
through one slit, then it somehow returns through the other slit and again 
passes through the first slit (we get a sort of loop encompassing both the 
slits). Finally, we may assume that near the slits the electron is “spread 
out” (delocalized) in space, partially passing through one slit and partially 
through the other, and while impinging on the detector screen, it is again 
localized in space (we get a temporary splitting of the electron into two 
“halves”). There is no special need to prove that both the attempts made 
above to model the passage of electron through two holes are artificial. 
This becomes quite obvious if we turn to more complicated interferome- 
ters. Thus, in Michelson’s interferometer, for example, one “half” of the 
microparticle will have to move towards one reflecting mirror and the 


other “half”, towards the other (in completely the opposite direction). 


Thus, the interference curve I(x) greatly complicates the problem 
about the nature of the passage of a microparticle through a screen with 
two slits. If the microparticle passes through one slit, then either there 
should be no interference, or we must admit that the microparticle has a 
hidden ability to “perceive” the neighbouring slit. The only logical con- 
clusion arising from the existence of interference is that the microparticle 
passes simultaneously through two slits, though the mechanism of such 
an unusual passage is not clear. In such a situation, obviously, a direct 
experiment could be helpful. Why shouldn’t we try to “spy” on the elec- 
tron to see exactly in what way an electron passes through the slits in the 


screen? Such experiments were actually conducted. Let us see what they 


led to. 

Let us imagine that near the slits A and B of screen 2 we have light Experiment 2 (“Observing” a 
sources 4 and photodetectors 5 (Figure 7.3), designed for “observing” the Microparticle in the Interfer- 
passage of electron through the screen with slits (the photodetectors reg- ometer) 


ister light scattered by the electron). If the electron simultaneously passes 
through both the slits both the photodetectors are activated simultane- 
ously, But if the electron passes through either one of the slits, only one 
detector is activated; in this case we shall also know through which slit 


the given electron passes. 


So, we place an electron source at point 1, switch on the light sources 
4 and watch the photodetectors 5. We shall assume that the electrons 
pass through the apparatus one by one: the source emits an electron only 
after the preceding one has reached the detector screen. What does the 
experiment show? It always turns out that only one photodetector (either 


left or right) is activated and both photodetectors are never activated 





simultaneously. It means that the electron passes not through two slits 


but only through one. Moreover, we can always indicate the slit through 


Figure 7.3: Observing the microparticle in 
a double slit experiment. 
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which any electron passes. 


The reader may surmise that to explain interference we again have 
to start talking about an electron “perceiving” the neighbouring slit by 
some secret means, while passing through a slit. We shall not jump to 
conclusions but shall first carry out the experiment to the end; we obtain 
a sufficiently large number of events of electron incidence on the detector 
screen 3 and see how they are distributed. Here, we get a surprise. On 


screen 3 we get not the interference curve I(x) but the additive curve 
h(x). 


We repeat the experiment after switching off the light sources (thus 
depriving ourselves of the possibility of “observing” the passage of the 
electrons through the slits). In this case we again observe the interference 


curve I(x). 


The situation is such that interference occurs when light sources are 
switched off and is absent when they are switched on. As soon as we try 
to control the process of passage of electrons through two open slits, the 
interference disappears. In other words, our observation of the behaviour 


of electrons near the slits destroys the interference! 


A change in the nature of the motion of electrons upon the switching 
on of the light sources has a simple physical explanation: this change is 
the result of collision of photons with the electrons “being controlled”. In 
the process of measurement some influence on the object under investi- 
gation is inevitable. Here only one thing is important — can this influence 
be made sufficiently small? Maybe the experiment considered above was 
too crude? Maybe we should somehow try to observe the electrons more 


delicately, so that the interference picture is not destroyed by this. 


But how to reduce the influence of a photon on an electron? It is obvi- 
ous that a reduction in the light intensity of the sources would not yield 
anything - light intensity is associated with the number of photons in a 
beam, thus a reduction in the intensity will simply result in an increase in 
the number of “unregistered” electrons. We must reduce the energy of a 
single photon. But for this we shall have to increase the wavelength of the 
radiation and this will lead to an increase in the spatial delocalization of 
the photon (a photon is localized in space with an accuracy not exceeding 
its wavelength) and thus for a wavelength exceeding the distance between 
the slits the photon will no longer be in a position to register a particular 


slit. 


But, may be, we could think of some other experiment - without re- 
sorting to the scattering of photons by electrons? For example, isn’t it 


possible to try to construct an extremely light screen with slits in such a 
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way, that it could move to the left or right upon being struck by individual 
electrons? The screen deflects to the left if an electron passes through the 
left slit and to the right if it passes through the right slit (Figure 7.4). But 
if the electron passes through two slits simultaneously, the screen will 

not move at all. Thus, we just have to follow the movement of the screen. 
It would appear that at least in principle the aim has been achieved - the 
required delicate experiment has been conceived. But it is meaningless 

to set up such an experiment. To make sure that this is so, we recall the 
uncertainty relation for momentum and coordinate. It follows from this 
relation that if the experimental conditions really permit us to register 

the momentum of the screen due to a recoil from the electron impinge- 
ment, the same condition must lead to an uncertainty in the position of 
the screen on the line OO (Figure 7.4). Consequently, the shift in such 

a screen does not permit one to draw any conclusion on the nature of 
passage of an electron through the slits. If, on the other hand, we fix the 
position of the screen on the line OO, then it is easy to see that a measure- 


ment of the momentum of its recoil will become impossible. 


Several attempts were made to devise such an experiment in which 
the passage of electrons through a screen with slits could be “controlled” 
without seriously influencing the electrons themselves (so that the in- 
terference is not destroyed). But all these attempts proved futile. As a 
result, we must admit that the above conclusion regarding the destruction 
of interference caused by observing the behaviour of electrons near the 
slit, is of a fundamental nature. In other words, the effect of observation 


(measurement) destroying interference cannot be eliminated in principle. 


A Brief Interlude 


He was going into one room 


And by mistake entered another ... 


Griboiedoff (Gore ot Ouma) 


READER: This means we were not able to determine exactly how 
an electron passes through a screen with slits-through one slit or 


simultaneously through both the slits? 
AUTHOR: Indeed, we couldn’t. 


READER: But then experiment 2 did not attain its goal. Was it neces- 


sary to consider it? 


AUTHOR: Yes, it was. The experiment did not answer the question 


posed by us. So what? It just means that the question was not 


Figure 7.4: Modifications to the double slit 
experiment. 
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formulated properly. We see that we cannot question all the phe- 


nomena of nature. 


READER: Does the whole idea of the experiment lie in its negative 


result? 


AUTHOR: This is not to be ignored. However, as we shall see later, 
it also contains a positive result of utmost importance. In fact, we 


were looking for one thing, but we found another. 
READER: And what is that? 


AUTHOR: Let us not make haste. We shall first consider our system e 


fundamental experiments to the end. 


Let us pass a beam of light through a polarizer, say, a tourmaline crystal. Experiment 3 (Passage of 

A linearly polarized light beam emerges from the crystal. The direction of Photons Through Polarizers) 
polarization of the beam is determined by the orientation of the polarizer 

with respect to the beam (the direction of polarization coincides with the 

direction of the axis of the polarizer). Let us denote the intensity of the 


linearly polarized light beam through J. 


Further, we place a second polarizer in the path of the linearly polar- 
ized light beam and consider the following three cases: (a) the axis of the 
second polarizer is parallel to the axis of the first; (b) the axis of the sec- 
ond polarizer is perpendicular to the axis of the first; (c) the axis of the 


second polarizer makes an angle æ with the axis of the first. 


We shall measure the intensity of light emerging from the second 
polarizer. In case (a) we get intensity J, in case (b) we do not get anything, 


while in case (c) we get a light beam of intensity I cos? a, polarized along 





where AA and BB are the directions of the axis of the first and the second 


NO 


the axis of the second polarizer. These cases are shown in Figure 7.5, 
(4) a 
A 


polarizers, respectively. 


The above experiment is well known in classical optics. However, like 
the Young experiment on interference, it has a direct relation to quantum 
mechanics. As in the case of interference we shall reduce the intensity 
of the light beam till the photons pass one by one through our set-up. 
We shall consider the cases illustrated in Figure 7.5 as applied to single 


photons. 


First of all, we recall that a photon is characterized by a definite polar- 
ization. Polarization!of a photon Moreover, this polarization corresponds 
to the polarization of the classical light wave from which the photon un- 
der consideration has been “taken”. In particular, this means that after the ( 0) A 
first polarizer we shall have linearly polarized (polarized in the direction 


of the axis of the polarizer) photons. In the following we shall “deal” only Figüre7.5: Three casesof passager 


photons through two polarizers. 
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with these photons and shall call them “initial photons” 


In case (a) the initial photon always passes through the second po- 
larizer; in case; (b), on the contrary, it never passes the second polarizer. 
These results are not unexpected. But what happens in case (c)? It turns 
out that in this case the photon may pass through the second polarizer or 
it may not. Moreover, it is absolutely impossible to predict which of the 
two alternatives (passing or not passing) will be realized for a given initial 
photon. If it so happens that the photon passes through the second polar- 
izer, its polarization will change — it will be polarized in the direction of 
the axis of the second polarizer. Thus, the fate of any particular individual 


photon is, in principle, unpredictable! 


Let us assume further that there are N initial photons. We observe 
their passage through the second polarizer in the case (c) and see what 
happens. We find that if N is sufficiently large, the number of photons 
passed can be predicted fairly accurately; it will be about N cos’ a. In this 
connection, we recall our earlier remark that the unpredictability in the 
behaviour of an individual microparticle is related to the predictability in 
the behaviour of a large number of microparticles (see Experiment 1). We 
can say that there is a definite probability of the initial photon passing 
through the second polarizer. This probability is equal to cos” a. 


Let us now complicate the experiment. We use the situation shown 
in Figure 7.5 (c) and add yet another (i.e. a third) polarizer, whose axis 
is perpendicular to the axis of the first polarizer. The three-polarizer 
system under consideration is shown in Figure 7.6. Let N be the number 
of initial photons (i.e. photons passing through the first polarizer). After 
the second polarizer, as we already know, we shall have N cos? a photons, 
the polarization of these photons coinciding with the axis of the second 
polarizer. Analysing further in the same way we conclude that after the 
third polarizer, we must have N cos? a sin? a photons; moreover, the 
polarization of these photons must coincide with the axis of the third 


polarizer. The experiment certainly confirms this conclusion. 


There is nothing that appears astonishing in this (that is, of course, if 
we assume that our astonishment over the existence of two unpredictable 
possible behaviours of an individual photon has somewhat diminished). 
And yet there is something here which contradicts our usual concepts. Let 
us remove the second polarizer. Then no photons will be observed after 
the third polarizer. This creates a fairly interesting situation. Photons 
pass through this apparatus, as if they are “filtered” first through the sec- 
ond polarizer and then through the third. As a result, we at first have N 


photons, then N cos? @ and we are finally left with N cos? a sin? æ photons. 


We remove one of the “filters” and thus, it would appear, improve the 


conditions for passage of photons through the given apparatus. However, 


Weos“a sina 


Neos 


Figure 7.6: Passage of photons through 
three polarizers. 
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in actual practice it turns out quite differently - now the photons do not 


pass through the apparatus at all! 


We shall consider elastic collisions of microparticles and use for con- 
venience the centre of mass system for the colliding particles. Figure 7.7 
shows experimental diagram related to the system of the centre of mass 
of the particles. Here, A and B are particle beams, 1 and 2 are the counters 
for scattered particles, deployed on the line perpendicular to the direc- 
tion of motion of the particles before collision. Thus, we consider here 
the scattering of particles through an angle of 90° in the centre of mass 


system. 


We note that the picture of the process in the centre of mass system 
may considerably differ from the analogous picture in the laboratory 
system. Thus, for example, in the laboratory system the counters 1 and 2 
may not be on the same line. Besides, in actual practice only one beam of 
particles (for example, particles of type A) may be used while the particles 
of the other type (type B) constitute the stationary target. It is assumed 
that every time the experiment in the laboratory system is conducted 
in such a way that the diagram shown in Figure 7.7 is applicable for the 


centre of mass system of the particles. 


We shall consider different examples as applied to the above diagram, 
measuring each time the probability of scattering of particles by the 


number of simultaneous activations of counters 1 and 2. 


First Example. Particles of type A are a-particles (*He nuclei), parti- 
cles of type B are *He nuclei; counter 1 registers only @-particles, 
counter 2, only *He nuclei. Let w be the probability of scattering 


measured in this case. 


Second example. The particles are the same, but now each counter can 
register both @-particles and *He nuclei. In this case the measured 
probability of scattering turns out to be 2w. This result appears 
quite natural - the doubling of the probability w is associated with 


the realization of the two alternatives shown in Figure 7.8. 


Third example. We replace *He nuclei by a-particles. Let the a-particles 
be scattered now by a-particles. It would appear that in this case 
the scattering probability must be the same (or nearly the same) 
as in the previous case, i.e. 2w. The experiment, however, yields 
quite a different result, 4w. A “mere” replacement of *He nuclei by 
“He nuclei has doubled the scattering probability! Still more unex- 
pected results are observed by taking into account the spin states 
of the colliding particles (in the case of a-particles the question of 
taking spin into account did not arise since a-particles do not have 


spin). In this connection let us consider scattering of electrons by 


Experiment 4 (Scattering of Mi- 


croparticles by Microparticles) 
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Figure 7.7: Elastic collision of microparti- 
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electrons. We recall that an electron may exist in two spin states 
(o = 1/2, —1/2). Electrons created as a result of photoelectric emis- 
sion, for example, appear in one spin state or the other with same 
probability. Such electron beams are termed nonpolarized; half the 
electrons in them have ø = 1/2 and the other half, o = —1/2 . If we 
take special measures, we may obtain a polarized electron beam in 
which all electrons are in the same spin state. Having made these 
remarks, let us now return to the diagram in Figure 7.7 and con- 
tinue the list of examples under consideration. Moreover, we shall 
assume that the energies of the colliding particles are considerably 
small, hence, the possibility of an electron changing its spin state 


upon collision need not be taken into consideration. 


Fourth example. The two electron beams are nonpolarized. Let the 


scattering probability measured in this case be we. 


Fifth example. The electron beams are polarized but in both directions. 
For example, A-electrons have spin ø = 1/2 and B-electrons, o = 


—1/2. In this case the scattering probability turns out to be 2we. 


Sixth example. The electron beams are polarized in the same direction. 
In this case the counters 1 and 2 are “silent” — the scattering proba- 


bility is zero! 


As will be seen later, the results of experiments on the scattering of mi- 


croparticles reveal fundamental quantum-mechanical laws. 


We have thus considered a system of four fairly simple experiments. 
While considering them we emphasized the unexpectedness of the results, 
which indicates the impossibility of their classical explanation. The sys- 
tem of basic experiments could have been enlarged and supplemented by 
more complex experiments. However, we shall not do this. We content 
ourselves with considering the four experiments, as we think that all the 


basic principles of quantum mechanics are fairly clearly revealed in them. 


On the basis of the experiments considered, we move on to build up in 
the following sections a system of quantum-mechanical concepts which 


essentially expresses the physical foundations of quantum mechanics. 


8 Amplitudes of Transition Probabilities (Formulation of Basic 


Principles) 


We suppose that for a certain microparticle definite initial and final 
states (s-state and f-state, respectively) are considered. The specific char- 
acteristics of these states as well as the nature of the microparticle are 


immaterial for the present. As has been pointed out earlier, the transition 


Conclusion 


Introduction to the Concept 
of Amplitude of Transition 
Probability 
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of the microparticle between the two given states has, as a rule, a proba- 
bilistic character. We therefore introduce into the picture the transition 
probability w,_,. In quantum mechanics, apart from transition probabil- 


ity, the concept of the amplitude of the transition probability (f | s} ? is 2 The treatment of quantum mechanics 
on the basis of probability amplitudes is 
given in books by Feynman and Dirac. 


whose modulus is equal to the transition probability: Please note that this is not a square of 
(f |s) but the footnote number. 


also introduced. Generally speaking, it is a complex number, the square of 


(8.1) Feynman, R. P., Leighton, R. B., and 
Sands, M. (1965). Lectures on Physics, 
volume 3. Addison-Wesley Reading, 

Note that the amplitude of the transition probability is written so that the Mass; Feynman, R. P. and Hibbs, A. 

(1965). Quantum mechanics and path 


. . . integrals. McGraw-Hill, N.Y; and Dirac, P. 
from right to left). Henceforth for brevity we shall call the amplitude of A. M. (1958). The Principles of Quantum 


woo =I FIs) 


initial state is on the right and the final one on the left (as if it were read 


the transition probability the transition amplitude (and sometimes even Mechanics. Clarendon Press Oxford 


more briefly simply the amplitude). 


Besides introducing the concept of the transition amplitude, which is of 
utmost importance to quantum mechanics, we shall devote this section to 
a formulation of a number of basic principles in their most general form. 
The reader should not be perplexed over the formality of the exposition 
in this section. It will be compensated for in the next section where we 
shall demonstrate the principles indicated in this section using specific 
examples. Moreover, among other things, we shall consider the connec- 
tion between these principles and the experiments discussed in Section 7. 
This will provide, on one hand, grounds for the formal description of 
the principles and, on the other hand, an explanation of the astonishing 


experimental results. 


We shall indicate four basic rules of working with transition ampli- Basic Rules of Working with 
tudes. These rules should be considered as postulates forming the basis of Amplitudes 
a system of quantum-mechanical concept which are in conformity with 


experiment. 


First Rule. We assume (Figure 8.1 (a)) that there are several physically 
indistinguishable ways (paths) in which a microparticle can move 
from s-state to f-state. In this case, the resulting transition ampli- 
tude is the sum of the amplitudes corresponding to the different 


modes of transition: 
Tiss) FE (8.2) 
(the index i denotes the i-th mode of transition). 


Second Rule. We assume (Figure 8.1 (b))that there are several final states 
(fi f2,.--, fi- --) and that we are considering the probability of 
transition to any of these states, no matter which state it is. In this 


case, the resulting transition probability | (f | s} |? is the sum of the 
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transition probabilities to the various final states: 


IFIS =J IFI) (8.3) 


Third Rule. Let us assume (Figure 8.1 (c)) that the transition s — f takes 
place through some intermediate state (v-state). In this case we 
introduce the idea of the amplitude of the successive transitions 
s — vandv — f (corresponding to the amplitudes (v | s} and 


(f |v); the resulting amplitude is the product of these amplitudes: 


(fs) = (fe) ls) (8.4) 


In other words, if the transition is broken up into successive steps, 
the transition amplitude is expressed by the product of the ampli- 
tudes of the separate steps. It should be noted that if the notation 
in (8.4) is read from right to left, then the states will be designated 
in proper sequence: first the initial state, followed by the intermedi- 


ate state and then the final state. 


Fourth Rule. Suppose (Figure 8.1 (d)) have two independent microparti- 
cles. Suppose one microparticle undergoes a transition s — f and 
the other simultaneously undergoes a transition S — F. In this 
case the resulting transition amplitude for the system of micropar- 
ticles is given by the product of the transition amplitudes for the 


individual microparticles: 
(fF | sS) = (fs) (F |S) (8.5) 


We can see that the second, third and fourth rules appear quite natural 
since, together with (8.1), they represent well-known theorems, i.e. the 
theorem of addition of probabilities (second rule) and the theorem of 
multiplication of probabilities of independent events (third and fourth 
rules). Only the first rule, which may be called the rule of addition of 
amplitudes, appears unusual. In a certain sense, the entire system of 
quantum-mechanical concepts is based on the rule of the addition of 


amplitudes. 


Suppose that the transition of a microparticle from the initial to the 
final state (s — f transition) always takes place through one of the inter- 
mediate states (v1, v2,...v;...) (Figure 8.1 (e)) this case, one or the other 
mode of transition s — f (one alternative or the other) is determined by 
the “participation” of the corresponding intermediate state in the transi- 


tion. 


We take two different cases. Suppose in the first case the intermediate 
state through which a given transition takes place is known. This is the 


case of physically distinguishable alternatives. To describe this we must 


(a) first wile 


(0) seeoud we 


(C) thd tule 
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Figure 8.1: Rules of working with ampli- 
tudes. 


Distinguishable and Indis- 
tinguishable Alternatives. 


Interference of Amplitudes 
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combine the second and the third rules. The transition probability that we 


obtain as a result will be of the form 
ICFs)? = X IG lo) oils) l (8.6) 


One might ask: where does the second rule come in if it only involves 
the various final states? As a matter of fact, if we know the intermediate 
state at which the microparticle arrives, it may be treated as the final 
state of the first step of the transition. We fix the microparticle in this 
state and temporarily stop the experiment here. One microparticle will be 
fixed in one state, the others in various other states, so, a situation with 
different final states actually arises. We stress here that distinguishability 
of alternatives is connected with the actual existence of different final 
states (even if in the given experiment they play the role of intermediate 


states). 


In the second case, we don’t know the intermediate state through 
which a particular transition takes place. This is the case of physically 
indistinguishable alternatives. To describe this case we should combine 
the first and the third rules. The resulting transition probability will be of 


the form 


(fis) = A (f loi) (oils) (8.7) 


The result (8.7) is a specifically quantum-mechanical one. When it holds, 
we speak of the interference of amplitudes of transitions through different 
intermediate states. We emphasize that the interference of amplitudes 

is possible only under conditions of the physical indistinguishability of 
the alternatives corresponding to the given experiment (the microparti- 
cle is not fixed in the intermediate state, hence we are actually dealing 
only with one final state). Changing over from amplitude in (8.7) to the 
probability of transition, we get 


IFIS P= D7 1 GF Lan) (orl s) P (8.8) 


The difference between the cases of physically distinguishable and indis- 
tinguishable alternatives becomes clear from a comparison of probabilities 
(8.6) and (8.8). While in the first case the probabilities of the alternatives 
are added, in the second case it is the amplitudes of the probabilities of 


alternatives which are added. 


It is interesting to expand somewhat the fourth rule regarding the Transition Involving Two Mi- 


simultaneous transition of two microparticles. Let us suppose (Fig- croparticles 
ure 8.2 (a)) that one microparticle undergoes a transition s — f through 

the intermediate v-state and that another microparticle simultaneously 

undergoes the transition S — F through the intermediate V-state. By 


combining the third and fourth rules we represent the transition ampli- 
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tude for the system of microparticles in the following form: 


(fF | sS) = (f 10) (| s) FLV) (VS) (8.9) 


Let us assume further (Figure 8.2 (b)) that both the microparticles in 
the process of their transitions ass through one and the same intermediate 


v,-state. Then (8.9) must take the form 


(fF | sS) = (f | v1) (1 |s) (F | 01) (a1 |S) (8.10) 


Finally, we assume (Figure 8.2 (c)) that each microparticle realizes a num- 
ber of physically indistinguishable alternatives through different interme- 
diate states (v1, v2, . . ., vi). Moreover, every intermediate state is common 
to both the microparticles. In this case generalizing result (8.10) by com- 


bining it with the first rule, we get 


(fF | sS) = >; (F | 01) (vi | s) (F | vi) (ai | S) (8.11) 





The interference of amplitudes is destroyed when the alternatives become 

oa: . : . N Figure 8.2: Rules of working with ampli- 
distinguishable. We shall show how this happens by using transitions tudes; 
involving two microparticles [by using the result (8.11)]. Destruction of the Amplitude 


Interference 
For brevity, we shall call the microparticle undergoing thes => f 


transition the s-particle and the one performing S — F transition as 

the S-particle. We assume that S-particles are used to “control” (observe) 
through which intermediate state the transition of the s-particle takes 
place in each specific case. To exercise this control, we must have as many 
final F-states (denoted by F}, F>, . . ., F; ) as there are intermediate v-states 
to be “controlled” and arrange it so that an S-particle passing, for example, 
through the v;-state, goes just to the Fj-state. This is schematically shown 
in Figure 8.2 (d). Here each intermediate state appears to be “bound” 

to a definite final F-state. We simply have to watch the final state in 
which a “controlled” S-particle appears in each case. Whereas earlier 

(in the absence of S-particles) the alternatives were indistinguishable, 

so that it was not known in each case through which intermediate state 
the transition of the s-particle took place, the different alternatives now 
become physically distinguishable. An S-particle observed in any F-state 
unambiguously indicates the intermediate state through which a given 
transition took place. According to the remarks made above, the use of S- 
particles for distinguishing intermediate states must lead to a destruction 


of the interference of amplitudes. Let us verify this now. 


If only the S-particle which has passed through the intermediate v;- 


state comes to the F;-state, then we have as a result (F; | vk} = 0 for k + i. 
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Using (8.11) we get from this 


(fF; | sS) = (f | vi) (vi | s) (Fi | vi) (vi | S) (8.12) 


Since the F;-states are the various final states, we get, according to second 
rule, the following expression for the resulting transition probability of 


the “controlled” s-particle: 


(Fs) P= XIF: sS) 2 
: (8.13) 
= $1 loi) (oil s) (F: lo) (oi 18) F 


If we further assume that the amplitude (F; | S} is the same for all i 
(which is often the case in practice), then, denoting this amplitude by a 


for brevity, we rewrite (8.13) in the form 
(Fis) P= lal? >" (fle) (ils) ? (8.14) 


Thus while result (8.8) is obtained in the absence of S-particles (in the 
absence of “control”), we now have the result (8.14). It is easy to see that 
it corresponds to (8.6) - we destroy the interference of amplitudes by 
establishing a “control” over the intermediate states, i.e. by turning the 


physically indistinguishable alternatives into distinguishable ones. 


9 Amplitudes of Transition Probabilities (Demonstration of Ba- 


sic Principles) 


Using the concept of the transition amplitude and the rules relating Behaviour of a Microparticle 
to the amplitudes, let us turn to Experiment 1 discussed in Section 7. An in the Interferometer and 
electron emerges from the initial s-state, passes through a screen with Interference of Amplitudes 


slits A and B, each of which corresponds to its intermediate state (A-state 
and B-state, respectively) and is finally registered in its final x-state, i.e. 
at the point with coordinate x on the detector screen. Suppose that slit 

A is open and slit B is closed. In this case (x |s) 4 = (x |A} (A |s}. The 
probability of transition s — x, i.e. the probability of the electron being 


registered at point x of the detector screen, is of the form 


Ilsa =| A) (Als) I? (9.1) 


We denote this probability by I; (x), and recall that this is how we denoted 
the distribution of electron incidences on the detector screen in Exper- 
iment 1 (Section 7) under the conditions that slit A is open and slit B is 
closed. For the probability of an electron being registered at point x in the 
case when slit B is open and slit A is closed, we may write the analogous 


expression 
| (xls) |? = | (x |B) (BI s) |? = h(x) (9.2) 
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Now let us open both the slits. Since it is impossible to indicate through 
which slit any electron passes (the alternatives are indistinguishable), we 


have, consequently, 
(x | s) = (x | A) (A |s) + (x | B) <B |s} (9.3) 


Denoting the transition probability | (x | s) |? by I(x) (we denoted the 
interference curve observed on the detector screen with both slits open in 
the same way in Section 7), and taking into account (9.1) and (9.2) we get 
from this 

I(x) = | (x| A) + (A| s) + (x | B) (Bs) |? (6.4) 

= I(x) + h(x) + (x|s)4 («| s)p + (x1 5) 4 (1 5)B 

It is easy to see that the resulting probability of transition is not equal to 
the sum of probabilities of the transitions through slits A and B [I(x) + 
T(x) + b(x)]. 


In addition to the components I; (x) and I(x), the right-hand side of 
expression (9.4) contains two additional terms caused by the interference 
of amplitudes. It is these terms that account for the difference of the 


interference curve I(x) from the additive curve B(x) = h(x) + R(x). 


Thus, the interference distribution of the electron incidences on the 
detector screen observed with both the slits open in Experiment 1 in 
Section 7 is a consequence of the result (9.4), i.e. a consequence of the 
interference of the amplitudes of two possible transitions of the electron 


from the given initial state to the given final state. 


Figure 9.1 depicts schematically the fundamental Experiment 2 consid- Destruction of Interference of 
ered in Section 7. Here s is the electron source, S is the photon source, F; Amplitudes upon “Controlling” 
and F, are photoelectric counters which fix the two final states of photons the Behaviour of a Microparti- 
scattered by electrons in the vicinity of slits A and B. ele ie Aten acouieies 


To begin with, we shall assume that the photons scattered in the vicin- 
ity of either of the slits may be registered in the F,-state as well as the 
F,-state (which corresponds to the use of radiation with a sufficiently 
large wavelength). In this case, obviously, the photons don’t “control” 
the passage of electrons through the screen with slits. We denote the 


transition amplitudes thus: 
for electrons 


(x |A) (Als) = 1 
(x |B) (B |s) = 2 


(9.5) 


for photons (taking into account the symmetry of the photon transition 





Figure 9.1: Destruction of amplitudes. 
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which can be clearly seen from Figure 9.1) 


(Fi | A) (A| S) = (Fz | B) (BIS) = yı 
(F: | A) (A|S) = (Fi | B) (BIS) = y2 


(9.6) 


Using these notations and result (8.11), we write the following ex- 
pression for the probability amplitude of simultaneously registering an 


electron at point x and a photon in the F;-state 


(xF: | sS) = gil + P22 (9.7) 


Correspondingly, for the probability amplitude of simultaneously reg- 


istering the electron at point x and the photon in the F,-state, we write 


(xF: | sS) = gio + G21 (9.8) 


The probability of an electron being registered at point x, independent of 
where the photon is registered, is of the form (according to second rule 


from Section 8) 
| (x |8) |? = | (xFy | sS) |? + | (xF? | sS) |? (9.9) 
Substituting (9.7) and (9.8) into this, we find 


læs) = (art +12?) da? + | ye I?) 


(9.10) 
+ (9102 + 9 G2) (Waa + Yiya) 


Thus the resulting probability of the electronic transitions — x is 

made up of two components. The first is the sum of the probabilities of 
transitions through slits A and B (considered separately) multiplied by 

(| v4 |? + | y2 |?). The second component has an interference character; it 
is due to the interference of amplitudes. Because of the existence of this 
component, we observe the interference distribution of the impingements 


of electrons on the detector screen. 


Thus, when the photons don’t “control” the passage of electrons 
through a screen with slits, we observe the interference effect described 


by the expression (9.10). 


Remember that in the example considered here we assumed a suffi- 
ciently long wavelength for the radiation. Let us now reduce the wave- 
length. This will lead to a reduction in the probability of a photon scat- 
tered by an electron falling in the “alien” counter (for example, the proba- 
bility of a photon scattered near slit A being detected by counter Fz). This 
means that with a decrease in the radiation wavelength, the amplitude 
W2 must decrease. A decrease in the amplitude y, will also decrease the 


relative contribution of the interference component as is seen clearly 
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from (9.10). As a result, the interference pattern observed on the detector 
screen gets blurred. For a sufficiently small radiation wavelength, it is 
possible to accurately “control” the passage of electrons through a screen 
with slits. In this extreme case a photon scattered near either of the slits 
arrives only at its “own” detector. This means that y) = 0. Substituting 


this result in (9.10) we get 


Ils) P = ya PC ar? + | ge |?) (9.11) 


Thus, “control” of the passage of electrons through a screen with slits 
leads to a destruction of the interference amplitudes, and consequently to 
a disappearance of the interference distribution of the electrons Impinge- 
ments on the detector screen. The result (9.11) is in complete accord with 
(8.14). “Control” makes the alternatives corresponding to the passage of 


an electron through different slits distinguishable. 


From this example we see that there is a subtle point involved in the 
question of the distinguishability of alternatives: in addition to the com- 
plete indistinguishability and complete distinguishability, there is a con- 
tinuous spectrum of intermediate situations which should be identified 
with partial distinguishability. The result (9.11) describes the limiting case 
of complete indistinguishability of the alternatives under consideration 
(y2 = 0). The opposite extreme case of the complete indistinguishability 
of alternatives envisages equal probabilities for a photon falling on its 
“own” or the “other” detector: yı = y2. In this case it is easy to see that 


expression (9.10) assumes the form 
| (xs)? = 21y Pei + ezl? (9.12) 


Results (9.11) (the squares of the moduli of electron amplitudes are added) 
and (9.12) (the electron amplitudes themselves are added up) are obtained 
from (9.10) as particular (limiting) cases. The general expression (9.10) 
describes the intermediate situation corresponding to partial distinguisha- 
bility of the alternatives under consideration, differing from one another 
by the magnitude of the interference component. The less the interference 


component is the greater is the degree of distinguishability of alternatives. 


Thus, distinguishability and indistinguishability are by no means 
discrete. Complete indistinguishability is continuously transformed into 
complete distinguishability through intermediate situations corresponding 
to partial distinguishability. In Section 10 we shall return to the question 
of partial distinguishability from the point of view of the principle of 


superposition.” ,* 


We turn to Experiment 4 of Section 7. Let sı and sz be the initial states 
of the colliding microparticles and f; and fz be the final states registered 


by the corresponding counters. In Section 7 we considered the scattering 


3 Partial distinguishability of alterna- 
tives is described in detail in Helfer et 
al. 4 Helfer, Y. M., Lyuboshitz, V. L., and 
Podgoretsky, M. I. (1975). Gibbs’ Paradox 
and Identity in Quantum Mechanics (in 
Russian). Nauka, Moscow 


Scattering of Microparticles 


and Interference of Amplitudes 
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through an angle of 90° in the centre of mass system of the colliding par- 
ticles. For a more general approach, we shall consider scattering through 
an angle 0. In this case the counters are arranged along a straight line 

at an angle @ with the initial direction of the colliding particles - see Fig- 
ure 9.2 (a) (the analysis is carried out, as before, in the centre of mass 


system of particles). 


If upon scattering one microparticle undergoes a transition sı — fı and 
the other a transition s2 — fz (Figure 9.2 (b), the scattering amplitude has 


the form 


9 (9) = (fil si) (fa | s2) (9.13) 


Another alternative is also possible: one microparticle undergoes a tran- 
sition sı — fz and the other, ss — fi (Figure 9.2 (c)). In this case the 


scattering amplitude has the form 


o (x = 8) = (fal si) (fi | s2) (9.14) 


Suppose that the microparticles are completely distinguishable. This 
may mean, for example, that different kinds of microparticles, or the same 
kind of microparticles with different spin states are colliding. We first 
consider a situation when counter fı registers microparticles from sı only 
and counter f) from sz only. In this case the probability of a simultaneous 


activation of both counters is 


le (DP = | fil si) (fl s2) l? (9.15) 


Second situation: each counter registers any of the microparticles partic- 
ipating in the collision (the situation corresponds to the second example 
in Experiment 4 of Section 7). We should now consider two possible alter- 
natives. Since these alternatives are also completely distinguishable for 
completely distinguishable microparticles, the probability of simultaneous 


activation of both counters will be given in this case by the expression 


lg)? +1 e(a- 9)? SIAI Rls) P+ lD (filse) |? (016) 


For 0 = 1/2, we get the probability 2| ¢ (7/2) |? - it is this doubling of 
probability which we mentioned in the second example in Experiment 4 


of Section 7. 


Further, we assume that the microparticles are completely indistinguish- 
able. This means that microparticles of the same type and in the same 
identical state are considered. Note that the identity of microparticles 
mentioned in Section 6 is a necessary condition for complete indistin- 


guishability. 


If the microparticles are completely indistinguishable, so are the al- 


Figure 9.2: Three examples of colliding 
microparticles. 
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ternatives shown in Figure 9.2 (b), (c). In this case we should sum not 
the probabilities of the alternatives, but their amplitudes. The probability 
of simultaneous activation of the counters should be determined by the 
expression 


w=|9(0)|+9(1-6)|? (9.17) 


When applied to the third example in experiment 4 of Section 7 (when we 
considered the scattering of a-particles by a-particles), the result assumes 


the form 


w =| (2/2) +9 (2/2) |? 


9.18 
= 4| (2/2) pey 


It is this four-fold increase in the probability | g (7/2) |? which was ob- 


served in the experiment. 


Interference of scattering amplitudes is just one of the consequences of 
the complete indistinguishability of microparticles. Another consequence 
is that the probability of the simultaneous activation of counters should 
not change if we interchange s; and s2, or, in other words, if we inter- 
change the scattering amplitudes ọ (0) and (a — 0). If we proceed from 


these consequences, the probability may be formally written in the form 
w=|9 (0) + 9(1-6)|’ (9.19) 


The alternative with the “+” sign (interfering amplitudes have the same 
sign) is already familiar to us - it is the expression (9.17). The other alter- 
native, when the amplitude with opposite signs interfere, is also formally 
possible. It is remarkable that nature “employs” this alternative as well; 
this can be verified by studying the results of experiments on scattering of 


electrons by electrons. 


Thus, we assume that amplitudes with opposite signs interfere: 
we =| (8) — (1-8)? (9.20) 


and turn to the results of the indicated experiments. For 0 = 2/2, the 
probability (9.20) vanishes. This corresponds to the sixth example in 
Experiment 4 of Section 7. We recall that this example concerned the 
collision of electrons in the same spin state. It is in this case that we have 


two completely indistinguishable alternatives for the electrons.° >We shall henceforth omit the word 

“completely”, but shall always mean it 

while speaking of distinguishable and 

indistinguishable alternatives. Partial 

in Experiment 4), the alternatives are distinguishable. In this case the distinguishability will be specially men- 
tioned. 


If the colliding electrons are in different spin states (the fifth example 


probability of the activation of the counters is given (as for distinguish- 
able particles) by expression (9.17), which for 0 = 2/2 leads to the familiar 
result 2 | ọ (1/2) |? In the case of non-polarizedelectron beams (the fourth 


example in Experiment 4), it should be remembered that the probability 
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1 
of collision between two electrons in similar spin states is ~. From this, 
taking (9.20) and (9.16) into account, we get the following expression for 


the probability of activation of the counters 


we = =| (9) - 9 (4-8) [? 


2 (9.21) 
[lo (8) |? +| 9 (2-9) |" 


+ 


NIle 


Result (9.21) includes the summing of amplitudes (for cases characterized 
by indistinguishable alternatives) as well as the summing of probabilities 
(for cases characterized by distinguishable alternatives). For 0 = 1/2, 
(9.21) gives the probability we = | g (1/2) |?.This is half the “classical prob- 
ability” (i.e. the probability taking place in the case of indistinguishable 
alternatives) in complete agreement with the results of the experiments 


considered in Section 7. 


Thus, we have found that the experiments on the scattering of mi- Interference of Amplitudes and 
croparticles described in Section 7 provide a good experimental back- Division of Microparticles into 
ground for the concept of the interference of amplitudes. Moreover, these Bosons and Fermions 


experiments indicate the necessity for using not one but two different 
laws of interference, (9.17) and (9.20). We shall discuss the meaning of 
these two laws, assuming that 0 = 2/2. According to (9.17), we have for 
a-particles 


w = 4| (2/2) |? (9.22) 


and from (9.20) we have for the electrons in the same spin state 
we =0 (9.23) 


The use of angle 0 = 2/2 makes the scattering diagram symmetrical. 

If in addition we also take into account that the electrons are in similar 
spin states (a-particles do not have spin), we may conclude that expres- 
sions (9.22) and (9.23) describe the probabilities of a-particle pairs and 
electron pairs, respectively, appearing in the same state. Comparing this 
expression with the “classical probability” 2 | ø (27/2) |”, we can come to 
the conclusion that one kind of microparticles (a-particles in this case) 
exhibits a tendency to “populate” a given state densely, while other mi- 
croparticles (electrons in this case), on the contrary, may “populate” states 


only one at a time. 


The fact that all microparticles in nature are divided, according to their 
behaviour in assemblies of similar particles, into two groups — bosons 
(with a tendency to densely “populate” the same state) and fermions (“pop- 
ulating” the states only one at a time) has already been mentioned in 
Section 1. Now we see that this fundamental fact is associated with the 
existence of two different laws for the interference of amplitudes. In the 


case of bosons, the amplitudes with like signs interfere; in the case of 
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fermions, it is the amplitudes with opposite signs that interfere. 


Let us consider an example: there are three atoms emitting photons 
(s1, $2, $3) and three photon counters (fi, f2, f3). The amplitude of prob- 
ability that three transitions sı — fi, s2 — fo, s3 — fs take place 
simultaneously is (fi | s1) (f2 | s2) (fs | 53). We assume that the photons are 
registered in the same state. We then have 3! = 6 indistinguishable alter- 
natives. Besides the one indicated above, the remaining five amplitudes 


are given by 


(fi | 82) (fa | s1) B | s3), 
(fi ls) (fs | 82) fa | 83) 
(fs | s1) (fa | s2) (fil 83) 
(fi | 82) (fa | 83) B | S1) 5 
(fi lss) (fa | s1) (fs | s2) 











Assuming that the amplitude of each transition s; — fi, is the same (we 
denote it by a), and taking into account the indistinguishability of alterna- 
tives, we get, for the amplitude of a transition with the emission of three 
photons, the expression 3! a, and for the probability of the transition 
(3!)?| a? |?. If the alternatives were distinguishable (if distinguishable mi- 
croparticles were emitted), the probability would be expressed by 3!| a? |’. 
Generalizing these results for the case of n microparticles, we get for emis- 
n |2 and n!| a” |? 


sion probabilities the expressions (n!)?| a , respectively. 


Let w, be the probability of the emission of n bosons (photons in this 
case) in the same state, and W, the probability of the emission of n distin- 


guishable microparticles in the same state. It is easy to see that 
Wn = n!W,, (9.24) 


Consequently, the probability of the combined detection of n bosons 
is n! times greater than the probability of the combined detection of n 
distinguishable microparticles. We rewrite the result (9.24), replacing n by 
n+1: 

Wn+1 = (n+ 1)! Wn+1 (9.25) 


Dividing (9.25) by (9.24) we get 


w W, 
n+1 = (n+ 1) n+1 


n n 








(9.26) 


This means that the probability of getting one more boson in a state 
where there are already n bosons is (n + 1) times greater than the proba- 
bility of getting one more distinguishable microparticle in a state where 
there are already n such microparticles. We note further that for distin- 
guishable microparticles the degree of prior “population” of the state is 


not important. When applied to bosons, it is analogous to the situation 


Bosonic Nature of Photons 
and Processes of Spontaneous 


and Induced Emission of Light 
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when a boson appears in a state which was not occupied earlier. Hence 
result (9.26) may also be interpreted in a different way. The probability 
of getting another boson in a state having n bosons is n + 1 times greater 
than the probability of a boson appearing in a state which was hitherto 
unoccupied. In accordance with this interpretation we can rewrite result 


(9.26) in the form 
| (n+ 1|n) |? = (n+ 1)| (1/0) |? (9.27) 


Here we consider a particular state, and (1 | 0) is the amplitude of the 
transition: 

(unoccupied state) — (state with one boson), (n + 1 | n} is the ampli- 
tude of the transition: 


(state with n bosons) — (state with n + 1 bosons). 
When applied to amplitudes and not probabilities, result (9.27) means 
(n+1|n) = Vn+1(1|0) (9.28) 


Let us analyse the result (9.27) by considering photon emission (emission 
of light). It is easy to see that the probability of emission is split into two 


components: 


| (n+ 1|s) [2 =| (10) |? (9.29) 
| (n+1|n) [7 =n] (1/0) |? (9.30) 


The component (9.29) describes the probability of spontaneous emission 
of light and the component (9.30) that of induced emission of light. In 

the case of spontaneous emission, the transitions in a substance (to be 
more precise, in the radiating atoms) are spontaneous. They are mutu- 
ally unrelated and are independent of external radiation. In contrast to 
spontaneous emission, induced emission depends on the existence of pho- 
tons near the radiating atom — the more photons there are, the greater is 
the probability of induced emission. It turns out that on account of their 
bosonic nature, the photons “extract” as it were, new photons from the 
substance. To be more precise, they stimulate transitions in a substance 
which lead to the emission of new photons. We stress that a “stimulated” 


photon is created in the same state in which the “stimulating” photon 
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was. 

Experiment shows that the probability of the absorption of light by a Absorption of Light and Con- 
substance depends upon the number of photons in the radiation. In this nection Between the Ampli- 
respect the process of absorption of light is an induced process. By using tudes of Direct and Reverse 
the analogy with (9.30) we can write the following expression for the Trandate 


probability of the annihilation of a photon in a state having n photons: 


| (n—1|n) |? =n]|(011) l. 
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We rewrite this result, replacing n by (n + 1): 
| (n|n+1) |? = (m+ 1)| (011)? (9.31) 


Comparing (9.31) with (9.27), we conclude that the probabilities of direct 


and reverse transitions are equal to each other: 
| (n+ 1|n) |? =| (n[n+1)|’. 


Thus, using a specific example, we have demonstrated the so-called 


principle of microscopic reversibility: 


IFIS)? = IsI fy? (9.32) 


When applied to transition amplitudes, this principle means 


Fls) = 6f (9.33) 


Result (9.33) supplements the set of four rules applicable to transition 
amplitudes, which we have considered in Section 8. It may be treated as 
an additional, fifth rule. 


We shall consider two useful examples which demonstrate the am- Supplementary Examples 
plitude concepts very clearly. Scattering of neutrons by a crystal. We 
shall consider the scattering of very slow neutrons (with energies of the 
order of 0.1 eV and lower) by atomic nuclei. It is well known that for such 
low energies, the scattering amplitude 9, considered in the centre of mass 
system for a neutron and a nucleus, is independent of the scattering angle. 
So, it would appear, the scattering probability should also be isotropic. 
However, experiments on the scattering of very slow neutrons by crys- 
tals reveal a strong angular dependence of the scattering probability. A 
typical curve is shown in Figure 9.3: peaks are observed against a smooth 
background. These peaks are a visual demonstration of the effect of the 


interference of amplitudes. We shall see how this is so. 


Suppose a neutron emerges from its initial s-state and is registered in 
its final f-state. The intermediate i-state corresponds to the i-th nucleus rn acai 


in the scattering crystal lattice. According to the third rule of Section 8, 


(fis); = (fli) @ (ils) (9.34) 


We assume that all the N nuclei of the crystal are alike, have zero spin, Figure 9.3: Results of scattering of neu- 


trons by a crystal. 
and are located strictly at the lattice points. 


In this case it is impossible in principle to indicate precisely which nu- 
cleus causes the scattering of the neutron — the alternatives corresponding 


to the scattering by differently “numbered” nuclei are indistinguishable. 
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Hence, we must take into account the interference of the amplitudes: 


N 
(fls)= X Fl) 


i (9.35) 
= DF 1 eCils) 


Taking (9.35) into account, the probability of scattering of the neutron has 


the form A 


N 
IFIS P =le PA ID ls) (9.36) 


On account of the addition of the amplitudes of the scattering at the 
nuclei, which are distributed in space in a definite order, a mutual “can- 
celling” of amplitudes is possible in some directions and a mutual “en- 
hancement” in some others; this interference effect is manifested in the 


form of sharp peaks for definite scattering angles. 


We further assume that nuclei in the crystal have a spin (let it be equal 
to 1/2, as for neutrons). In this case we should distinguish between the 
amplitude of scattering by the nucleus with spin inversion (in accordance 
with the law of conservation of momentum, the neutron spin must also he 
inverted in this case), and without spin inversion of the nucleus (and the 
neutron), i.e. between y and ọ, respectively. If the scattering of the collid- 
ing microparticles is accompanied by spin inversion, the corresponding 
alternative is distinguishable - it is clear that in such an act of scattering 
only that nucleus participates whose spin has been inverted. Now the 


probabilities and not the amplitudes should be summed. 


Taking account of the amplitudes ọ and y we can represent the scatter- 


ing probability of neutrons from s-state to f-state in the following form: 


2 2 


IFIP =le] +I xl (9.37) 





N N 
DF 1H (ils) Df li (ils) 


The first term on the right-hand side of (9.37) accounts for the character- 











istic interference peaks in Figure 9.3, while the second term is responsible 
for the smooth background. It is customary to say that the first term de- 
scribes the probability of the coherent scattering, and the second that of 


the incoherent scattering of neutrons. 


Atomic beam in inhomogeneous fields. Let us assume that a beam 
of atoms occupying a certain initial s-state (this is a definite spin state) 
passes first through one region of non-uniform magnetic field (B1), then 
through another (B2) and is finally registered in its final spin state f. Non- 
uniform magnetic fields are used here as factors capable of changing the 
spin state of the atoms in the beam. Experiment shows that the probabil- 


ity of the s — f transition is different in the cases when 
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(a) observations are carried out to find the state of the atoms in the beam 


between fields B; and By , and when 


(b) such observations are not carried out. 


The difference in probabilities | (f | s) |* in these two cases is easily ex- 





plained if we take into account that in case (a) the intermediate i-spin 
states are fixed each time and, consequently, the alternatives correspond- 


ing to them are distinguishable; hence: 
ICFs) P= DOLLA Gls)? (9.38) 


In case (b) the intermediate i-states are not fixed and, consequently, the (i $) 


alternatives corresponding to them are indistinguishable. Hence 


2 Figure 9.4: Path integral approach. 


ifls)? = DF 1) ils) (9.39) 


FEYNMAN’S PATH INTEGRALS. In conclusion indicate the possibility of a somewhat unusual 
formulation of quantum mechanics, based on amplitude concepts. In the beginning of this section, we 
considered an interferometer in the form of a screen with two slits. There were two indistinguishable 
alternatives corresponding to the two interference amplitudes. Let us now assume that instead of a 
screen with two slits, we have a screen with n slits. The number of alternatives (and, correspondingly, 
amplitudes) will be equal to n. Further, we place another screen with n slits parallel to the first screen; 
the number of alternatives (amplitudes) will rise to n?. We shall continue this process by placing more 
and more screens in the space between the source of microparticles and the detector screen; moreover, 
we Shall simultaneously keep on increasing the number of slits in each screen. In the limiting case of 
an infinitely large number of screens and an infinitely large number of slits, we arrive at a situation 
when the entire space between the source and detector screen will turn out to be “filled” by different 
‘paths’ of the microparticle, each corresponding to a definite alternative and a corresponding definite 
amplitude (one such path is indicated in Figure 9.4 (a). The total amplitude of s — x transition is the 
sum (more exactly, the integral) over all the possible amplitudes. Finally, we assume that the system 
of screens with slits was introduced only fictitiously; in fact there are no screens and the entire space 
between points s and x is “filled” by all the possible paths. If we can write the transition amplitude for 
each such trajectory, then the probability of the s — x transition may be found as the square of the 
modulus of the integral summing up all the given amplitudes (called the path integrals). In this sense, 
the quantum-mechanical motion from s — x is nothing but the superposition of a set of classical 
motions (classical trajectories) almost in the same way as indicated in Figure 9.4 (b). The transition 
from quantum to classical mechanics corresponds to the reduction of the given superposition of paths 


to a certain individual trajectory. 


The concept of the motion of a microparticle along classical path integrals (in other words, through 


interference of amplitudes corresponding to the classical trajectories) is discussed in detail in Section 5. 
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10 Superposition of States 


In the previous section, while studying the problem of the probability 
of the transition of a microparticle from one state to another, we intro- 
duced and discussed the concepts of the transition probability amplitude, 
distinguishable and indistinguishable alternatives of transition, the inter- 
ference of amplitudes corresponding to indistinguishable alternatives, all 
of which are specifically quantum-mechanical concepts. on the basis of a 
number of examples the reader should convince himself of the importance 
of the concept of interfering amplitudes, which gives an explanation of 


the results of various experiments with microparticles. 


The interference of transition amplitudes is inseparably linked with 
one of the most fundamental principles of quantum mechanics — the 
principle of superposition of states, which reflects the specific nature of 
the “interrelations” among the states of a microparticle. We shall now 


consider this principle. 


Earlier, in Section 3, we studied the uncertainty relations. In this con- Principle of Superposition of 
nection it was remarked, in particular, that the states of a microparticle States 
were combined in groups each of which contained the definite values of 
anyone complete set of physical quantities. We also gave examples of 


complete sets of values for an electron and a photon. 


Continuing the discussion commenced in Section 3, we shall introduce 
notations for the various complete sets: the a-set, the f-set, the y-set, 
etc. In this context we shall speak of the group of a-states, the group of 


f-states, etc. 


We suppose that the microparticle is in one of the a-states. It means 
that the quantities in the a-set have definite values. But what can we say 
about the values of the quantities in any other complete set, for example, 
the /-set? According to the uncertainty relations, the quantities in the 
f-set do not have definite values in the state under consideration. The 
reader is justified in taking this as a negative fact. But, fortunately, this 
is fully “compensated” by a positive circumstance - the principle of the 


superposition of states. 


According to the principle of superposition, there exists a link between 
the states of the microparticle, corresponding to different complete sets: 
any state from one set may be represented in the form of a superposition 
of states from another set. Thus, for example, a given a-state may be 
represented in the form of a superposition of f-states. If we arbitrarily 


adopt the symbol ( | to indicate the state of a microparticle, the principle 
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of superposition may be written in the form 
(al = X Pop (BI (10.1) 
B 


The expression (10.1) appears as an “expansion” of the given state (a | 

into the sum of f-states, the numbers ©, playing the role of coefficients 
in the expansion. More concretely, the number ®, is the amplitude of 
the probability that we shall obtain values corresponding to the state (£ | 
while measuring the quantities from the f-set in the state (a |. In other 
words, it is the amplitude of the probability that a microparticle in state 
(æ |, may also be found in the state (£ |. If we denote this amplitude by the 


usual symbol (a | p}, expression (10.1) assumes the following form: 


(al =D) alp (BI (10.2) 
B 


In the preceding sections we considered the amplitudes of the transition 
probability (in short, transition amplitudes). At first glance, the principle 
of superposition “brings into play” a new type of amplitudes of probability. 
Actually, the above-mentioned expression “(a | f} is the amplitude of the 
probability that a microparticle in state (a | may also be found in state 
(p| allows another obvious interpretation: “(a | p} is the amplitude of the 
probability of a microparticle arriving in state (£ |, if it is known that the 
given particle actually exists in state (a |”. In short, the latter statement 
means that (a | p} is the amplitude of the probability with which any £- 
state is “represented” in a given a-state. This produces the impression 
that the amplitudes (a | f} don’t have anything to do with any transitions 
or processes. Considering this impression, we introduce a new term for 

(a | P) - the amplitude of probability of state (in brief, the amplitude of 
state). 


The reader must be warned at once that the above impression is erro- 
neous. However, in order to be convinced of this, we must analyse the 
process of measurement. Measurement in quantum mechanics will be 
discussed in Section 11. This discussion will necessitate some correction 
in the above definition of the amplitudes (a | $} and will enable us to actu- 
ally reduce the amplitudes of states to the already familiar amplitudes of 


transitions. 


But everything has its own time, and so, for the time being, we shall 
operate with the concept of “amplitude of state” as an independent con- 
cept, without ascertaining the practical meaning of, say, the phrase “a 


particle in state (a | can also be found in state (£ |”. 


As has been noted earlier, the principle of superposition “supplements”, 
as it were, the uncertainty relations; its positive content “compensates” 


the well-known negative aspect of these relations. Figuratively speak- 
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ing, the uncertainty relations indicate the “old” concept which must be 
rejected while we go over from macrophenomena to microphenomena. In 
particular, they require a rejection of the simultaneous measurability of 
all physical quantities characterizing a given object. At the same time, the 
principle of superposition indicates the “new” concept which is applica- 
ble when considering microparticles; superposition (10.2) means that if a 
microparticle is in a state in which the quantities of the a-set are measur- 
able, then the value of the quantities in the f-set may be predicted with a 
probability equal to | (a | B) |?. 


In classical physics one comes across superposition quite frequently, Superposition in Classical 
the foremost example being the well-known superposition of classical Physics and Quantum Me- 
waves. From a mathematical point of view, the classical superposition and chanics 


superposition in quantum mechanics are analogous. This circumstance 
greatly stimulated the development of quantum theory. At the same 
time, it certainly complicated the interpretation of the physical content 


of theoretically obtained results since it tempted one to draw erroneous 


analogies with classical waves. In the words of Dirac’: ®Dirac, P. A. M. (1958). The Principles 
of Quantum Mechanics. | Clarendon Press 
Oxford 


the assumption of superposition relationships between the states leads to a 
mathematical theory in which the equations that define a state are linear 
in the unknowns. In consequence of this, people have tried to establish 
analogies with systems in classical mechanics, such as vibrating strings or 
membranes, which are governed by linear equations and for which, there- 
fore, a superposition principle holds ...(remember the criticism in Section 5 
of the attempts to represent the motion of a bound microparticle with the 
help of classical waves in a resonator - author’s remarks). It is important 
to remember, however, that the superposition that occurs in quantum 
mechanics is of an essentially different nature from any occurring in the 
classical theory, as is shown by the fact that the quantum superposition 


principle demands indeterminacy in the results of observations. 


Speaking of the difference between quantum and classical superposi- 
tions, remember that a superposition of two classical waves leads to the 
generation of a new wave having, of course, new characteristics. How- 
ever, a superposition of two states (f,| and (/2|, characterized by the 
values fı and f; respectively, of quantities in the f-set, by no means leads 
to a state having any new value of f. As an example, let us consider a 


certain superposition of states 


(a | Bi) (Bil + Ca | Bo) (Bol 


We shall measure the /-quantity in this state. As a result, we get every 
time one of the earlier values, fı or f2. Moreover, it is impossible to accu- 
rately predict which of the two states will be obtained in any particular 
measurement. We can only indicate the probability of getting fı or p2 
These probabilities are equal to | (æ | £1) |? and | la | f2) |*, respectively. It 
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is this specific uncertainty in the results of measurements that determines 
the fundamental difference between quantum and classical superposi- 


tions. 


Taking into consideration the quantum-mechanical principle of super- 
position, let us return to the question of quantum transitions discussed in 
Section 2. Suppose we consider, as before, two energy levels E; and Ez of 
the microparticle. We denote the corresponding states of the particle in 
which it has energy E; or E; (i.e. is on the first or second level) through 
(1| and (2 | respectively. According to the principle of superposition, in 


addition to states (1| and (2| we can also get the state 


(fl = CF 11) Gl + FI 2) 2l (10.3) 


Measurement of energy of the microparticle in this state leads either to 
the result E; or to the result E; (as if the microparticle were on the level 
E; and simultaneously on the level E2). The first result is obtained with 
the probability | (f | 1) |, and the second, with the probability | (f | 2) |? 
The possibility, of the existence of such a specific quantum-mechanical 
situation immediately removes the basic contradiction of the quantum 
transition, mentioned in Section 2. It is sufficient to assume that an inter- 
action of a microparticle with radiation leads to a superposition state of 
the type (10.3). Then the probability of finding a particle on one energy 
level or another (on its earlier level or on a new one) will be described 


simply by the square of the modulus of the corresponding amplitude of 


state. 

In order to finally convince ourselves that the quantum-mechanical Mutually Orthogonal States: 
principle of superposition has in fact nothing in common with the clas- Total and Partial Distinguisha- 
sical superposition we turn to expression (10.2) and see how it changes bility of States 


upon the transition to classical physics. Since in classical physics all the 
quantities can be simultaneously measured, they form together one “com- 
plete set”. Considering that the superposition bonds described by relation 
(10.2) operate between different complete sets, we arrive at the conclusion 
that in the classical case such bonds simply do not exist and, consequently, 
all formally composed amplitudes of states must be taken as being equal 


to zero: 
(a |B) =0 (10.4) 


In quantum mechanics condition (10.4) also exists, but only “within the 
limits” of the given complete set (for states belonging to the same set). 
Thus, for example, 


(a; |a;) =0, if 14 j (10.5) 


The amplitude of a state is equal to zero if and only if the two correspond- 
ing states are mutually independent (if an object is in one of these states, it 


cannot be found in the other). Such states are called mutually orthogonal. 
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In this respect all the states of a classical particle are mutually orthogo- 
nal, while in quantum mechanics only the states belonging to the same 
complete set are orthogonal and the states belonging to different sets are 
non-orthogonal. This last fact is reflected in the principle of superposition 


of states. 


The idea of mutually orthogonal states permits us to define more 
precisely the concept of complete and partial distinguishability. For this 
purpose it is convenient to make use of the above-mentioned example 
of the scattering of microparticles of one type by one another. Since 
such particles are identical, their distinguishability is determined by the 
distinguishability of states (states (sı | and (sz |; see Section 9). We recall 
that in Section 9 we discussed, in particular, microparticles of the same 
kind but in different spin states, as examples of completely distinguishable 
microparticles. We thus tacitly assumed that different spin states are 
completely distinguishable. We can now clearly indicate the criterion of 
complete and partial distinguishability of states: if (sı | s2) = 0, the states 
(sı | and (s2 | are completely distinguishable; there are no superposition 
bonds between them. But if (s1 | s2} # 0, the states under consideration 
are partially distinguishable. In other words, orthogonality of states is the 


criterion of their complete distinguishability. 


Suppose we are considering scattering of two bosons of the same type. 
The result (9.16) will hold good if the initial states are mutually orthogo- 
nal (let these states be (a, | and (az |. The result (9.17) will be valid if the 
initial states are similar (say, (a1 | and (a |. Both these results are just two 
extreme cases corresponding to complete distinguishability and complete 
indistinguishability of microparticles, respectively. However, different 
intermediate situations corresponding to partial distinguishability are 
possible, when the initial states of microparticles are a superposition of 


several mutually orthogonal states: 


{s1| = (sı | 1) (æı| + (s1 | æ2) (a2 


(10.6) 
{S2| = {s2 | @1) (æ | + {s2 | @2) (aa 
It can be shown” that in this case the probability of simultaneous activa- 7 Helfer, Y. M., Lyuboshitz, V. L., and 
: A : y Podgoretsky, M. I. (1975). Gibbs’ Paradox 
tion of detectors is determined by the expression znd Identity-in Quantum Mechanics (A 
Russian). Nauka, Moscow 
2 2 
w=] (8) +| ọ(x-9)| 

(10.7) 


+ | (si | s2) |? Eo (0) g” (x — 8) + 9° (0) p (4 - 0)] 


For | (s; | s2) |? — 0, the result (10.7) turns into (9.16) (we come to the 
limiting case of complete distinguishability). For | (sı | s2} |? — 1, the 
result (10.7) turns into (9.17) (we come to the limiting case of complete 


indistinguishability). 
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Thus, we find that the question of the complete and partial distin- 
guishability of alternatives in quantum mechanics is closely linked with 
the quantum-mechanical principle of superposition, more precisely, with 


the mutual orthogonality or non-orthogonality of states. 


The different states corresponding to the same complete set of quan- 
tities are called basic states (eigenstates). The amplitudes of elementary 


states satisfy the condition 
(a; | a;) = Oijs (10.8) 


where 6;; is the so-called Kronecker delta symbol; it is equal to zero for 

i + j and unity for i = j. The expression (10.8) is called the condition of or- 
thonormalization of basic states. It is obtained by taking into consideration 
(10.5) and the fact that the probability of getting the value a; in the state 
(æi | is obviously equal to unity. The important property of the system of 
basic states is its completeness: any state may be expanded in a system of 


elementary states. 


Basic states may be selected in different ways depending on the com- 
plete set under consideration. Thus, different systems of basic states may 
be used: {(a; |}, {(f; |} etc. It is said that different representations are pos- 
sible. For a more general approach, the principle of superposition of states 
signifies the fact that any state (f | of a microparticle may he expanded in 


any system of basic states: 


(FL= 0 Cla) (al 
i (10.9) 


(FL =D) FI Bd Al 


The principle of superposition permits an explanation of the results of 
experiment 3 in Section 7. Using this principle, we shall consider the 
passage of individual photons through the system of three polarizers 
shown in Figure 7.6. We denote the state of polarization of a photon after 
the first polarization by (s|. According to the principle of superposition, 
the state (s| may be considered as a superposition of the basic states (1| 
and(2|, corresponding to two independent polarizations of the photon- 


along and perpendicular to the axis of the second polarizer, respectively: 


(s| = (s| 1) l| + (s12) (2 | (10.10) 


(note that in this example, the system of basic states contains only two 
states). The amplitudes of the states may be written in this case in the 


form (s| 1) = cos q and (s | 2} = sina. Thus, 


(s| = cosa (1|+sina (2 | (10.11) 


Basic States 


Superposition of States and 
Passage of Photons Through 


Polarizers 
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The second polarizer lets through photons from the state (1 | only. Since 
according to (10.11) the state (1 | is “represented” in state (s | with a prob- 
ability cos? a, out of N photons only N cos? a: photons will pass through 
the second polarizer. Moreover, all the photons that pass will appear in 
the state (1 | (i.e. they will be polarized long the axis of the second po- 
larizer). Thus, in front of the second polarizer, the photon exists as if 
partially in the state (1 | and partially in the state (2|. At the instant when 
the photon passes through the polarizer, this “duality” vanishes. In some 
cases a photon exists in the state (2 | and so cannot pass through the po- 
larizer, while in some other cases it is in the state (1 | and can thus pass 
through the polarizer. Further, for any individual photon it is impossible 
to predict in which state it will appear (hence it is impossible to predict 


whether a given photon will pass through the polarizer or not). 


While studying the passage of photons through the third polarizer, we 
proceed in a similar way. The state (1 | is expanded into a system of basic 
states (1’| and (2’| corresponding to a polarization of photon along or 


perpendicular to the axis of the third polarizer: 
|= sina (1'| + cos a (2’| (10.12) 


The third polarizer lets through photons in the state (1’| only. This state 
is “represented” in the state (1’| with a probability sin? æ. Hence out of 

N cos’ a, photons only N cos? æ sin” æ photons will pass through the third 
polarizer, all these photons being in the state (1’|. If we now remove the 


second polarizer, then in place of (10.11) and (10.12) we get 
(s| = (s| 1) (1| + <s | 2”) (2” (10.13) 


where, as can be easily seen, (s | 1’) = 0 and <s | 2’) = 1, so that (s | = (2’|. 
Naturally, in this case, at the outlet of the system of polarizers no photons 


are observed at all. 
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Let the transition from state (s | to state (f | take place through certain Principle of Superposition 
intermediate v-states. We suppose that the microparticle is not fixed of States and Interference of 
in the intermediate state so that a case of physically indistinguishable Transition Amplitudes 


alternative takes place. In this case, as we know, the transition amplitude 


(f |s} given by the expression 
Fs) =D) (Ff lon (oil) (10.14) 


where (f | vi} and (v; | s} are the amplitudes of the respective transitions. 
The intermediate v-states must be completely distinguishable, since oth- 
erwise there is no point in introducing the concept of distinguishable or 
indistinguishable alternatives, since the very concept of alternatives loses 


its meaning. Consequently, the v-states must form a system of mutually 
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orthogonal basic states. Taking this into consideration, we make use of 


the principle of superposition and express the state (f | in the form 


l= 2, To (0: | (10.15) 


where the amplitudes (f | 0;) are essentially the amplitudes of the states 
(the bar is used to distinguish the amplitude of the state from the transi- 
tion amplitude (f | v;)). If superposition (10.15) exists, there is actually no 
need to perform a transition from (s | to (f |. Since the state (f | is a su- 
perposition of states (v; |, it is sufficient to simply accomplish transitions 
from the state <s | to each of the states (v; |: This means that the transition 


amplitude (f | s} is a superposition of the transition amplitudes (v; | s}: 
(fs) = >) FT ei) tils) (10.16) 
Comparing (10.16) and (10.14) we conclude that 


(f loi) = (fF | oi) (10.17) 


This means that at the level of mathematical apparatus, we have accom- 
plished a reduction of the amplitudes of the states to transition of ampli- 
tudes. In other words, the amplitudes of the states in fact play the same 
role in the apparatus of quantum mechanics as the transition amplitudes. 
Incidentally, we have also established that relations (10.14) and (10.15) are 
closely related to each other and hence the effect of the interference of 
transition amplitudes and the principle of the superposition of states are 


directly interrelated. 


In conclusion, we mention an important method which is widely ap- 
plied in quantum mechanics. It is easy to see that if we delete the sign 
of the initial and final states in the expression for the interference of am- 
plitudes, we automatically obtain the superposition expression for the 
undeleted state. Thus, if in (10.14) we delete | s) on the left- and right- 
hand sides of the equation, we obtain the superposition expression (10.11) 
for the state (f | (note that in addition to the symbol ( |, the symbol | Y is 


used for denoting a state). 


We shall demonstrate the basic methods reflecting, in Feynman’s 
words, the “mechanics of quantum mechanics”. Let us delete the state (f | 


from the left- and right- hand sides of equation (10.14). This gives 


(s| =)! (1 = @ils) (10.18) 
i 
Let us further assume that some apparatus converts state (s | into some 


other state (s’|. We express this in the general form as 


Als) =|s’) (10.19) 


The Mechanics of Quantum 


Mechanics 
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We shall say that the operator A has acted on the state | s ) giving rise to 
state | s’). We shall apply the operator A to both sides of equation (10.18). 
Using (10.19), we get 


Is’) = $ Alon (ols) (10.20) 


Further, in place of the state (f | deleted earlier, we restore the state (v; | 


belonging to the system of basic states: 
(ols) = XA (0; [A] oi) (oils) (10.21) 


(the notation (vj |A | vi) should be taken as (vj | vt), where |v; } = A|v;}. 
Finally, we rewrite (10.14) replacing |s Y by | s’): 


FIs = >) (Fle) ails’) (10.22) 


J 


Substituting (10.21) into (10.22) we get 
(FIAIs) = >> >" (lo) (0; [A] ex) (oils) (10.23) 
i j 


Summarizing the methods demonstrated above, we write the equations 
obtained so far and leave it to the reader to follow the logic and appreciate 


the beauty of the transformations: 


s) = (f 1s) 
s= J, (fla) (orl) 


s) = Dail (als) 
A|s)= J,A (oil (oils) 





(0; |A]s) = D1 (Aloi) il) 
DF les) (or lAls) = X, Dd, Fes) (oA) Corl 


j 


(FIAS = >) >) Clo) (0; | A] ox) ls) 
Yi i 


Finally, we assume that operator A acts on the state | s}, and is followed 
by operator B. If the reader has mastered the logic of the “mechanics 
of quantum mechanics” (it would have been more accurate to call it the 


“algebra of quantum mechanics”), he will at once surmise that 


(f|BA|s) =)" >, >, fox) (ox |B]o;) (ol loi) oils) (10.24) 
k j i 
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(the notation (f | BA | s} should be taken as (f | s’’), where | s”) = B|s’) 
and |s’) = A|s)). 


This formal structure of the mathematical “manipulations” is character- 
istic for quantum mechanics. In what follows, this structure will assume 
a definite meaning on the basis of definite examples. Here it has been 


given in its most general form, which also allows us to follow its internal 


logic. We emphasize that the basis of this structure is the idea of quantum- 


mechanical interference (superposition) and the use of a certain system of 


basic states in terms of which the expansions are carried out. 


11 Measurement in Quantum Mechanics 


The question of carrying out measurements in quantum-mechanical 
systems and an interpretation of the results so obtained is rightly consid- 
ered to be very complicated, requiring even today further research. We 
shall not give a detailed analysis of the problem of quantum-mechanical 
measurements, but shall try to describe a number of fundamental state- 
ments which have been explained clearly, and demonstrate them using 


some examples.® 


Suppose that a microparticle is in a certain state (a |. According to the 
principle of superposition, the state (a | may be expanded in terms of any 


system of basic states, for example, in the {( £; |} system: 


(al = X (al Bi) «Bi (11.1) 
The numbers (a | f; ) in the superposition (11.1) are essentially the am- 
plitudes of the states ( f; | or, more accurately, the amplitudes of the 
probabilities with which the various basic states ( f; | are “represented” in 


the state (a |. 


The reader who has read the previous section is already familiar with 


all this. It is appropriate now to make things more precise. 


Firstly, we shall note that any definite superposition of states in a 
microparticle arises as a result of its interaction with the surroundings. 
The latter may be some macroscopic body (which is either produced 
artificially or is a part of the natural external conditions). This macro- 
scopic body is referred to as the analyzer. The expression (11.1) should 
be interpreted in the following way: as a result of the interaction with 
a particular analyzer (in this case, we may speak of a f-analyzer), a mi- 
croparticle lying in state (a | is transformed into the superposition state 


> (a | i) (Pil. Taking into account the quantum-mechanical nature 


L 
of the superposition state, it may be said that by interacting with the £- 


analyzer,the microparticle in some sense is “transformed” at once into all 


8The problem of measurement in 
quantum mechanics has been dealt with, 
in particular, in Lamb Jr. (1969); Bohm 
(1951); Blokhintsev (1968). 


The Origin of the Superposi- 
tion of States and the Meaning 
of the Amplitudes of States 
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the states ( f; |. Here, the amplitude (a | f;) should be treated as the ampli- 
tude caused by the indicated interaction of the transition («æ | — ( fi |. The 
quantity | la | B;) |? is the probability of finding the microparticle finally in 
just the ( f; | state. 


One can anticipate at least three questions from the reader. 


First question: What do we really mean by the expression “the mi- 
croparticle is at once transformed into all the ( £; | states”? The 
answer to this question will be given below in the subsection “Po- 
tentialities and their realization in the measuring process”. Here, 
we just remark that although the microparticle is “simultaneously 
transformed into all the (( f; |states” as a result of its interaction 
with the /-analyzer, it can be found in principle each time in only 
one f-state. Hence it may be said that no confusion arises. By the 
way, the reader has already come across a similar situation when 
considering Experiments 1 and 2 in Section 7. Here we once again 
remind the reader that quantum-mechanical logic does not always 


have a corresponding graphic model. 


Second question: If the amplitude of state is really the transition ampli- 
tude, what about the definitions of amplitudes of states given in 
Section 10? Answering this question we recall the following defi- 
nition given in Section 10: “(a | p} is the probability amplitude of 
a microparticle existing in state ( æ | being also registered in state 
( B |”. In this definition the word “existing” should be replaced by 
a more accurate word “existed”, since after interaction with the 
analyzer the microparticle no longer exists in the state (a |. Now 
the word “also” becomes redundant. The definition then assumes 
the following form: “(a | J } is the amplitude of probability of a mi- 
croparticle that existed in state ( æ | being registered in state ( p |? 
Registering is a kind of measuring process and (a | f } plays the 
role of the amplitude of the transition («| — (a |. which takes 
place in this process. Here we remark that in the above-mentioned 
interaction of the microparticle with the analyzer is just one part 
of the measuring process. A more detailed treatment of the measur- 


ing process will be given at a later stage. 


Third question: It was agreed earlier (see Section 8) to read the tran- 
sition amplitudes from right to left. If (a | p; ) is also transition 
amplitude, it should also be read in the reverse direction (from left 
to right). Isn’t it confusing? Actually, if we strictly follow the con- 
dition of writing the preceding states to the right of the ones that 
follow, then (11.1) should be written as 


DlA) (Bila) =la) (11.1a) 
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However, such a notation, as a rule, is not used. Hence we decided to 
allow some inconsistency, and in order to avoid possible confusion in this 
connection we shall in future retain the term amplitude of state along 
with the term transition amplitude. When using both these terms the 
reader must remember that from the point of view of physical meaning, 
the amplitude of state is nothing but the transition amplitude (this has 
already been shown from a mathematical point of view in the previous 


section). 


The reader has in fact already come across analyzers each time the 
interference of transition amplitudes was considered. We shall give a few 


examples. 


First example: [see (9.3)] - the analyzer is a screen with two slits. It 


gives rise to the superposition 


(s| =(s|A)(A]+(s|B) (BI (11.2) 


Second example: [see (9.35)] - the analyzer is a crystal lattice consisting 
of nuclei of the same type with zero spin. It generates the superpo- 
sition 

N 


(sl=)) (slide (il (11.3) 


t 


Third example: [see (9.39)] - the analyzer is a nonuniform magnetic field 


B; It gives rise to the superposition 


(sl= (sli) Gi (11.4) 


It may be said that an analyzer, generating a certain superposition of 
states, in fact ensures the emergence of indistinguishable alternatives. 
Moreover, the number of alternatives is equal to the number of basic 
states in the given superposition. In the first example this number is 
just equal to two (i.e. to the number of slits in the screen); in the second 
example, it is equal to the number of nuclei in the crystal, while in the 
third example it is equal to the number of spin states (i.e. to 2s + 1, if s is 


the spin of the atom). 


The process of measurement in quantum mechanics consists of three 


successive stages: 


(1) a preparatory stage when the microparticle is “prepared” in a cer- 


tain state ( æ |, considered below as the initial state; 


(2) a working stage in which the “prepared” microparticle interacts 


with a certain analyzer and goes over to the superposition state, and 


Examples of Analyzers 


The Essence of Measuring 


Process 
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(3) a registering stage in which the microparticle is registered in one 
of the basic states forming the superposition. In this stage the 
microparticle interacts with some macroscopic body, capable of 
changing its state under the influence of the microparticle. This 


macroscopic body is called the detector. 


If for the sake of simplicity we do not consider the preparatory stage, 
the abstract “scheme” of the measurement process may be written in the 
following way: 


(31> X (81 Bi) (Bl (Bi (11.5) 


Here the arrow 1 corresponds to the working stage, and the arrow 2 to the 


registering stage. 


The basic elements of the measuring instrument (measuring appara- 
tus) are thus the analyzer and the detector. The role of the analyzer has 
already been explained. We shall now consider the role of the detector. 
Figuratively speaking, its role boils down to “spying” on how the mi- 
croparticle “behaves itself” in the superposition state which was created 
by the analyzer. If we make use of the above examples of analyzers, this 
“spying” could provide an answer to the question: Which slit did a partic- 
ular electron pass through? Which nucleus of the crystal lattice scattered 
a particular neutron? What is the spin state of a particular atom? The 
reader who is familiar with the results of similar “spying” (in particular, 
with the results of the Experiment 2 in Section 7) may foresee that an “in- 
tervention” by the detector leads to the destruction of the superposition 
of states. The detector registers the microparticle each time in one of the 
states which constitute the superposition. This is done at the expense of 
the destruction of superposition. From the point of view of the ideas con- 
sidered in Section 8 and Section 9, this means that the detector converts 
indistinguishable alternatives into distinguishable ones and thus destroys 


the interference of the transition amplitudes. 


We separate from the “scheme” (11.5) the registering stage correspond- 


ing to the interaction of the microparticle with the detector: 
D431 Bi) (Bil > (Bi (11.6) 
i 
It is often said that a “scheme” of the type (11.6) describes a “construction” 


of the superposition s i | to the state / 6; |. This process is also 
perp P 


L 
known as “reduction of the wave packet”. 


Thus, while the analyzer creates a definite superposition of states, the 
detector destroys it by confining this superposition of states to one of the 
states constituting it. It is obvious that if the “scheme” (11.5) is tried on a 


single microparticle, it is difficult to say anything about getting any useful 
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information. It is necessary to repeat the measuring process for a suffi- 
ciently large number of microparticles. In this case the observer may find 
out, firstly, the values of the quantities of the /-set encountered in prac- 
tice, and, secondly, the frequency with which the microparticle is found in 
one f-state or another. This allows us to experimentally determine, firstly, 
the spectrum of the values of the quantities in a B-set and, secondly, the 


probabilities | <s | B;) |’. 


In the first place we note that the process of measurement has a radical 
influence on a microparticle. It is enough to point out that a change in 
the initial state of the microparticle in the measuring process is a circum- 
stance of fundamental importance. It is well known that while carrying 
out measurements with macroscopic bodies it is possible to isolate the 
object to a certain extent from the means of measurement. In quantum 
mechanics this is in principle impossible to do so. In other words, it is 
impossible to neglect the interaction of the microparticle with its sur- 


roundings. 


The “scheme” of the measuring process, and more concretely that 
part which is described by expression (11.6), is a demonstration of the 
existence of the element of chance in the behaviour of a microparticle. 
Indeed, it is impossible to predict unambiguously in which f-state a 


certain microparticle will he finally found. 


The impossibility of a graphic representation of the first stage of the 
process (when the analyzer creates a superposition of states) or the final 
stage (when the detector “confines” this superposition to a single state) 
is. also a specific feature of the quantum-mechanical measuring process. 
Thus, obviously, in the first stage of, the process of measurement it must 
not be assumed that a microparticle is literally “spread” over various 
states of superposition (for example, that it passes partially through one 
slit and partially through the other in the well-known two-slit experi- 
ment). Similarly, it is impossible to assume that as soon as a detector 
is activated (as soon as the act of registering occurs) the microparticle 
which is spread over various states suddenly assembles itself in one of 
these states. It should be noted that the treatment of final stage of the 
measuring process gave rise to many arguments. Supporters of a model 
(classical) interpretation of microparticles and microphenomena naturally 
were at a loss when they tried to visualize a “reduction of wave packets”. 
Inasmuch as the states making up a superposition may be spread in space, 
in this case the “reduction of a wave packet” should actually indicate a 
momentary spatial localization of the microparticle. In particular, the 
following example was extensively used. A wave packet interacts with a 
semitransparent mirror (the mirror plays the role of analyzer), is partially 


reflected and partially transmitted (which corresponds to a spreading of 


Some Peculiarities of the 
Quantum-Mechanical Mea- 


suring Process 
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the microparticle into the two states comprising the superposition). Detec- 
tors are placed in the path of each of the wave packets. It is known that 
each time only one detector is activated. Suppose that at a certain instant 
the detector placed in the path of the reflected part of the wave packet is 
activated. This means that the other part of the wave packet momentarily 
disappears from that part of the space where the unactivated detector is 
placed, and reappears in front of the second detector the moment before 
the act of registration. The absurdity of such “behaviour” of the micropar- 
ticle, which, by the way, “cannot know” which detector is activated in a 


given case, is quite obvious. 


Persisting in the efforts to retain the classical interpretation, attempts 
are sometimes made to turn to the classical interpretation of superposi- 
tion. Such an interpretation assumes that the microparticle after interac- 
tion with the analyzer, will actually appear in one of the basic states. The 
role of the detector is simply reduced to revealing the accomplished fact, 
i.e. in which basic state the microparticle appears upon interaction with 
the analyzer. Now everything appears very easy: activation of the detec- 
tor in the example cited above reveals the fact that the given wave packet 


upon interaction with the mirror was reflected and not transmitted. 


However, such a course has already been precluded. It is enough to 
remind the reader of the argument put forward in the discussion of Ex- 
periment 1 in Section 7. If it were possible to assume that a microparti- 
cle passes through one slit in the screen in some cases and through the 
other in some other cases, then, as has been remarked earlier, we would 
not have observed the interference pattern on the detector screen. In 
other words, the classical concept of superposition of states is equiva- 
lent to a destruction of superposition. It must be admitted that until the 
detector registers the microparticle, the latter exists in the supposition 
state 2 (s| £i} (fi | and not in any of the states ( p; |. As to the ques- 
tion of how this can be visually represented, we simply have to give 
up looking for an answer. It may be said that a discussion of quantum- 
mechanical measuring process (especially the problem of “reduction of a 
wave packet”) gives a specially convincing proof of fundamental impossi- 


bility of a classical interpretation of the microparticle. 


A detector in a quantum-mechanical measuring instrument is, as a rule, Comments on Detectors 
a macroscopic system in a state which is so unstable that a microscopic 
influence, or the influence of the microparticle, is enough to change it. In 
order to be “seen” by an observer, the - microparticle causes a complete 


“catastrophe”, an “explosion” on the scale of microphenomena. 


Examples of such “catastrophes” include events like the formation of 
a droplet in Wilson’s cloud chamber, or a bubble in a bubble chamber, 


chemical processes involving photo-emulsion grains, avalanche processes 
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of generation of secondary elections in a photo-multiplier, etc. The “ob- 
servable” microparticle might perish in the “catastrophe” caused by it (as, 
for example, in the case of the registration of a photon in a photomul- 
tiplier or of an electron on a detector screen). However, an even more 
interesting situation is possible when a microparticle “entrusts” the task 
of causing the catastrophe to other microparticles. Thus, for example, 

in Wilson’s chamber, the electron “under observation” creates various 
ions in its wake each of which in turn serves as a centre for the conden- 
sation of the supersaturated vapour filling the chamber. It is these ions 
that cause the “catastrophes” which the observer sees as mist droplets. A 
multitude of such droplets forms the track which is left by the electron 
“under observation”. We emphasize that an electron track is nothing but 
a multitude of successively occurring microscopic events, a totality of 


“catastrophes” on the level of microphenomena. 


When discussing the idea of quantum-mechanical principle of superpo- 
sition we come to a situation which is analogous to the one encountered 
when discussing the idea of wave-particle duality (see Section 5). In both 
cases, a visual (classical) interpretation is not possible. In both cases we 
come to a problem connected with potentialities and with their realiza- 


tion. 


The possibility and the actuality are the well-known categories of mate- 
rialistic dialectics. The contradiction that exists between them disappears 
every time a possibility is realized in one way or the other. Every particu- 
lar situation is characterized by a set of possibilities out of which only one 
is realized. The realization process is irreversible; as soon as it is accom- 
plished, the initial situation qualitatively changes (one of the possibilities 
is realized at the expense of all the other possibilities). The possibility that 
has been realized corresponds to a new situation with new possibilities. 
The process of resolving the contradictions between the possible and the 


actual thus turns out to be endless. 


In classical mechanics (as also in all theories of dynamic type) the 
problem of distinguishing between the possible and the actual does not 
arise because of the absence of the elements of chance in such theories. 
This problem arises in theories of statistical type. The basically statistical 
nature of quantum mechanics (where, as has been mentioned earlier, an 
element of chance is present in the behaviour of an individual object) 
indicates the importance of the problem of the possible and the actual 


when considering microparticles and microphenomena. 


It is from this point of view of resolution of contradiction between 
the possible and the actual that the measuring process in quantum me- 
chanics should be considered. The presence of a microparticle in the 


superposition state corresponds to a situation where the microparticle is 


Potentialities and Their Re- 
alization in the Measuring 
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characterized by a definite set of possibilities. The quantum-mechanical 
principle of superposition of states should be interpreted in this way only. 
In the interaction process of a microparticle with the detector [in the 
process (11.6)], the above-mentioned resolution of distinction between 
the possible and the actual takes place, the superposition of probabilities 
is destroyed and is replaced by one of the alternatives realized. It may 

be said that the “formula” of measuring process [formula (11.6)] is the 
mathematical expression for the process in which the dialectical contra- 
diction between the possible and the actual is resolved. The nature of this 


resolution is that of an irreversible and uncontrollable jump. 


Before concluding this section, it should be stressed that the problems 
of the interference of amplitudes, the superposition of states and of mea- 
surement are not isolated from one another, but form a single entity. Tak- 
ing this into consideration, we recommend that the reader should pause a 
little, go back and read through Section 7-11 once again, in order to get a 


general idea about the physical foundations of quantum mechanics. 
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Interlude: Are These the Same Waves?, Or, Again on Waves in 


Quantum Mechanics 


Call it then, what thou wilt-- 
Call it bliss! Heart! Love! God! 
I have no name to give it! 
Feeling is all in all: 

The Name is sound and smoke, 


Obscuring Heaven's clear glow. 


Goethe (Faust) 


READER: I would like to return to the question of classical superpo- 
sition. Sometimes I get a seditious idea: what is wrong with such a 
superposition? In any case it explains the interference effect more 
clearly than the superposition of amplitudes. I have due regard for 
the structure of quantum-mechanical concepts based on working 
with probability amplitudes, but still this idea keeps on haunting me 


from time to time. 


AUTHOR: Classical superposition certainly does explain interference. 
But what about the result of the experiment involving “spying” on 
the behaviour of a microparticle in the interferometer (for example, 
in the case of a screen with slits)? The experiment indicates a de- 
struction of interference. Classical interference does not explain this 


destruction. 


READER: But suppose it were possible to give a classical interpreta- 


tions of the destruction of the interference pattern? 


AUTHOR: No, such an interpretation is impossible in principle. This 
inference can be drawn from a consideration of the problem of 
the “reduction of a wave packet”. A classical interpretation of the 
microparticle is impossible, and hence a classical interpretation of 
the interference observed in experiments with microparticles is also 


excluded. 


READER: This means that the de Broglie waves have nothing in 


common with classical waves? 
AUTHOR: At least they are not classical waves. 


READER: Consequently, the diffraction of electrons is also not associ- 


ated with classical wave processes? 


AUTHOR: Of course, it is not. 


Participants: the Author and 
the Reader. 
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READER: In that case it is quite surprising that one so often comes 
across the term “wave” in quantum mechanics: “wave-particle 


duality”, “de Broglie waves”, “wave function”, “wave equation”, etc. 


Even quantum mechanics itself is sometimes called wave mechanics. 


AUTHOR: Terminology is created under definite historical circum- 
stances; it is not always quite correct. It is possible that the term 
“wave” is used too often and even without justification when con- 
sidering microphenomena. We have become accustomed to treating 
interference as a specific wave phenomenon. Hence it is not surpris- 
ing that when in 1927 experimenters observed interference patterns 
in experiments involving electrons, they immediately started using 
the term “electron waves”. Actually electron interference has a more 
“subtle” origin; it arises because, generally speaking, the probability 
laws of nature don’t follow the law of addition of probabilities, but 
require the summing of probability amplitudes. At first, of course, 
this was not known. And meanwhile the rather erroneous terminol- 


ogy took root in the literature. 


READER: And what about wave quantities such as the electron wave 
vector or its wavelength? They also appear in mathematical expres- 


sions. 


AUTHOR: They are wave characteristics of a microparticle, but are by 


no means parameters of any classical wave. 


READER: But we also have the classical light waves, or, in other 
words, photon waves. Aren’t the wave vector and the wavelength 


the parameters of a wave in this case also? 


AUTHOR: Here we have a qualitatively different situation. You have 
raised quite an important question. We recall that photons are 
bosons and hence have a tendency to densely “populate” states. Sup- 
pose we are considering a photon state characterized by the quan- 
tities k, w, a (æ is the photon polarization). This state is called the 
ka-state. Suppose there is only one photon in this state. We don’t 
have any classical wave yet. But suppose this state is “populated” 
by more and more photons. In the limiting case of a sufficiently 
large number of photons, we shall get a classical light wave whose 
parameters coincide with the characteristics of the photon state. We 
can now speak of the parameters of the classical wave, i.e. of the 
wave vector k, frequency w, polarization a. A peculiar “transforma- 
tion” of the wave characteristics of a photon into the parameters of 
a wave has taken place. It must he emphasized that this is the result 
of an accumulation of photons in one state. The classical wave has 


been generated as a collective effect. In the case of electrons, how- 
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ever, such a collective effect is in principle impossible. Electrons are 
fermions and hence can only “populate” states one at a time. Conse- 
quently, classical electronic (and in general fermionic) waves simply 


can not exist. 


READER: I think now I understand the position of waves in quantum 
mechanics. But if I am not mistaken, classical interference (classical 


superposition) may take place in boson ensembles. 


AUTHOR: You are right. By the way, that is the reason why the in- 
terference of light was discovered long before the interference of 


electrons. 


READER: It turns out that there are two kinds of interference phe- 
nomena in nature; classical interference resulting from a summation 
of waves, and quantum-mechanical interference where the probabil- 


ity amplitudes are summed. 


AUTHOR: In normal conditions, the classical interference (if we are 
talking of an ensemble of bosons) “masks” the quantum-mechanical 
interference. But in the case of electrons, for example, this “mask- 


ing” does not take place. In this respect we have a “pure” situation. 


READER: Since any mention of waves in quantum mechanics is valid 
only for bosons and that too only when they “populate” the states 
fairly densely, don’t you think it would be worthwhile to water 


down the “wave terminology” a bit? 


AUTHOR: I would prefer not to uproot the established terminol- 
ogy. First of all, it is not the terminology itself that is important 
but rather the meaning concealed behind it. However, if there is a 
choice, then one must certainly choose the most appropriate term. 
That is why we are using here the term “probability amplitude” in- 
stead of “wave function”. The former is more appropriate, though 


the latter is used more frequently. 


READER: Ihave already pointed out that so far, you have said noth- 
ing about the wave function, although it is widely used in all books 


on quantum mechanics. 


AUTHOR: For the same reason we shall also introduce the wave func- 
tion at a later stage. In fact we have already introduced it since the 
“wave function” and “probability amplitude” are terms describing 
the same thing. However, when speaking of the physical founda- 
tions of quantum mechanics, it is better to use the term “probability 
amplitude”. We can go over to the wave function when we consider 


the mathematical apparatus of the theory. 
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In conclusion I would like to stress that it is not the terms that are 
important, but the way they are used. We can use the term “wave” 
for a microparticle, but we must not forget its specific nature. Here 
it is worthwhile recalling the remark made in Section 5 about the 
impropriety of using the model of a classical wave in a resonator for 


a bound electron. 


12 Causality in Quantum Mechanics 


The impossibility of predicting exactly how a certain microparticle 
will behave in the measuring process at one time gave rise to talk of “in- 
determinacy” in quantum mechanics. In particular, it was suggested that 
causality is absent in microphenomena and in its place the uncontrollable 
event reigns. In time these ideas were replaced by a deeper understand- 
ing of the specific nature of time development of processes involving 
microparticles. The complaints about the indeterminacy of quantum 
mechanics proved groundless. It was realized that causality in microphe- 
nomena certainly is present although it is considerably different from the 


classical determinism which is characteristic for “old physics.” 


In quantum mechanics the principle of causality refers to the possibil- 
ities of the realization of events (properties). In other words, in quantum 
mechanics it is not individually realized events that are causally related, 
but only the possibilities of the realization of these events. This is the 
essence of the quantum-mechanical meaning of causality. As Pauli stated 


in his Nobel lecture, 


the statements of quantum mechanics are dealing only with possibilities, 
not with actualities. They have the form, ‘This is not possible’, or “Either 
this or that is possible’, but they can never say, that will actually happen 


then and there. 


Considering the specific nature of the quantum-mechanical meaning of 
causality, we stress the difference between possible and realized events in 
quantum mechanics. We also emphasize the objective nature of possibil- 
ities determined by the properties of the microparticles and the external 


conditions. 


Since possible and actually realized events are identical in classical me- 
chanics, it is clear that upon a transition from the quantum to the classical 
description of the world, the causal relation between the possible events 
must be converted into causal relation between realized events. In this 
sense the quantum-mechanical principle of causality is a generalization of 
the principle of classical determinism - it turns into the latter when going 


over from microphenomena to macrophenomena. 


One may ask: If in quantum mechanics it is not the realized events but 


The Specific Nature of the 
Quantum-Mechanical Concept 


of Causality 


The Manifestation of Causality 


in Microphenomena 
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rather the possibilities of their realization that are causally related, then 
how can the observer make use of such a causality? In an experiment one 


always has to deal with events that have been realized. 


The answer to this question is as follows: the observer must repeat a 
set of similar acts of measurements (for this he must have a sufficiently 
large number of similar microparticles, ensuring each time the same ex- 
ternal conditions for each of them). In each act of measurement a random 
value of the quantity being , measured will be realized. However, a set of 
these random values will allow us to find the law of distribution and the 
mean value of the quantity, which can be predicted a priori. The existence 
of a causal relationship between possible events is expressed through the 


possibility of such predictions. 


We shall consider an example. Let the quantities of the f-set be mea- 
sured in the (æ | state. By carrying out a large number of identical mea- 
surements, the observer gets a whole set of values f1, f2, 63,.... Having 


got N values, the mean value may be determined: 


1 N 
(B)= nuh (12.1) 


Suppose that the observer decides to repeat his observations the next day 
(or the next year). He will get a certain set of values i, b3, f3,.... The 
new set of values will be different from the old set, yet the new mean 
value determined by a formula of the type (12.1) will be close to the mean 
value ( f } obtained earlier (provided, of course, that N is sufficiently 
large). This means that there was no need for the observer to toil on 

the second day. The mean value could be predicted on the basis of the 


previous day’s measurements. 


Moreover, the value ( 8) could have been computed beforehand with- 
out carrying out any measurement. True, for this the observer would have 


to have known the amplitudes ( æ | f; ) in the superposition 
(al =D) (al Bi) (Bil (12.2) 
For computing (8) we must use the relation 


(B)= >) Bilal Bi)? (12.3) 


We assume that the reader has already understood the crux of the prob- 
lem. This lies in the fact that the causal relationship among possible 
events signifies a causal relationship among the probabilities of the real- 
ization of these events. In short, prediction in quantum mechanics has 

a probability character! In order to predict the quantity ( f ) in the state 


(a |, we must know the probability | ( a|f; ) |? of the realization of values 
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f; in the given state. But if these probabilities are not known beforehand 
one must collect the corresponding statistics of the measurements of 


f-quantities which allow us to find the required probability. 
Summing up the remarks made above, we quote Fock’s words: 


The probability of a given behaviour of an object in given external condi- 
tions is determined by the internal properties of the individual object and 
by the nature of these external conditions. It is a number characterizing the 
potential possibilities of this or that behaviour of the object. This possibil- 
ity manifests itself in the frequency of occurrence of the given behaviour of 
the object; the relative frequency is its numerical measure. The probability 
thus belongs to the individual object and characterizes its potential possi- 
bilities; at the same time, to determine its numerical value from experiment 
one must have the statistics of realization of these possibilities, so that 


many repetitions of the experiment are necessary. 


It has already been remarked in Section 6 when discussing the prob- 


lem of necessity and chance in microphenomena that quantum mechanics 


is a statistical theory. Hence, the question of causality in quantum mechan- 


ics may be seen first of all from the point of view of the manifestation of 
causality in general in statistical theories (as statistical mechanics, physi- 


cal kinetics, microscopic electrodynamics). 


In any statistical theory necessity and chance appear as dialectical 
categories. The relationship between them is that of unity and conflict of 
opposites. The velocities of molecules in a gas are random (any instan- 
taneous velocity distribution of molecules is random) but the average 
velocity of a molecule can always be determined. The incidence of any 
electron passing through a screen with slits at any particular point of 
the detector screen is random, but the resulting interference pattern can 
always be determined. Necessity, like chance, is present. in every statis- 
tical theory. It expresses causal relationships. Consequently, causality in 


statistical theories leads to necessity in these theories. 


While in dynamic theories necessity dominates, thus making exact 
predictions of the values of physical quantities possible, it appears in sta- 
tistical theories through distribution laws which give only the probabilities 


of various values of physical quantities. 


As an additional (quite important) manifestation of necessity in sta- 
tistical theories, we mention the conservation laws. It is well known that 
conservation laws may be seen as exclusion principles. From this point of 


view, the role of conservation laws in statistical theories is fairly obvious: 


they serve as conditions which reduce the probability of definite processes 


to zero. 


Causality in Statistical Theo- 


ries 


Statistical nature of quantum 


mechanics 
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When treating quantum mechanics as a statistical theory, one must 
remember that it occupies a special place among these theories. Within 
the framework of classical physics the laws describing the behaviour of 
large number of objects are of statistical nature, while the laws relating 
to the behaviour of an individual object are dynamic. By considering the 
element of chance in the behaviour of a single object, quantum mechanics 
places itself in a special position — that of a statistical theory of an individ- 
ual object. That is why we earlier called quantum mechanics a statistical 


theory in principle. 


This circumstance predetermines the specific nature of statistical 


ensembles in quantum mechanics. 
According to Fock a 


the elements of statistical collectives considered in quantum mechanics, are 
not the microobjects themselves, but the results of experiments with them, 


a definite experimental arrangement corresponding to a definite collective. 


Previously we considered an example in which measurements of quan- 
tities in the J-set were carried out on a microparticle in the state ( æ |. A 
large number of repeated measurements gives a certain set of numbers 
Pi; Be, P3»... This set of numbers is an example of statistical ensemble in 
quantum mechanics. Looking at the expression (12.3) we may conclude 
that | (a | 2i X |? plays the role of a distribution function for the given statis- 
tical ensemble. By changing the experiment (for example, by going over 
to the measurement of quantities in the y-set) the observer will be dealing 
with another statistical ensemble y1, y2, y3,.... It follows from this that it 
is in fact possible to have several statistical ensembles corresponding to a 


single microparticle. 


Classical and quantum-mechanical ensembles, as can be easily seen, 
are different in nature. In classical physics a statistical ensemble is made 
up of a set of many objects, while in quantum mechanics, it is the set of 
the many possible ways of realizing properties of a microparticle, that 
forms an ensemble. Any change in the conditions will lead to a new 
ensemble. In particular, this difference manifests itself in that in classi- 
cal physics averaging is performed over different states of the system, 
whereas in quantum mechanics we speak of mean values in a given state 
in the system (thus, in the example cited above, we spoke about the mean 


value ( f } in the (æ | state). 


Naturally, in quantum theory we also have to carry out averaging 
over different states and consider statistical ensembles formed by a set of 
microparticles. Such problems, however, are beyond the scope of quantum 
mechanics itself and form a separate discipline called quantum statistics. 


Quantum statistics deals with two types of statistical ensembles and in 


Statistical Nature of Quantum 


Mechanics 
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this sense it is a doubly statistical theory. 


Consider the following amplitude of state: Quantum-Mechanical Equa- 
tion Expressing the Principle of 


Cj (t) = (s(t) 1 i) ae) Causality 


This is the probability amplitude of finding a microparticle in its basic 
j-state while it is located in the s-state at time t (at time t the detector is 
activated and thus the microparticle is observed in some particular state). 
The causal relationship among the probabilities of realizing events, which 
is a characteristic property of quantum mechanics, must be manifested 
in the existence of a mutual relationship between amplitudes considered 
at the moments of time t and t + At. Taking into account the principle of 
superposition we express the relationship between amplitudes as a linear 
equation 

C; (t + At) = )) Uy (t,At) C; (t) (12.5) 

i 


(On the right-hand side the sum is taken over all the basic states.) The 
expansion coefficients Uj; (t, At) for small values of At will be represented 
in the form 

U; (t+ At) = 6; + =H; (0) At (12.6) 


This representation is justified by the fact that for At — 0 the coefficient 
Uj; will obviously be transformed into 6;; (6;; is the Kronecker delta 


symbol). 


Substituting (12.6) into (12.5), we get 


A Ci (t + At) — Ci (t) 
=j 
At 





= XHA C; (t) (12.7) 
J 
In the limit as At — 0 we find the equation 
a 
-iig Ci(t) = 2 Halo C; (t) (12.8) 


This is the fundamental quantum-mechanical equation representing the 
principle of causality. Here H;; are certain matrix elements describing the 


physics of the problem under consideration. 


The matrix H;; is called the Hamiltonian matrix. We make the follow- Hamiltonian Matrix 


ing remarks concerning the Hamiltonian matrix: 


(1) The time dependence of the Hamiltonian matrix reflects the depen- 
dence of physical conditions on time (for example, a microparticle 
situated in a magnetic field which varies with time). If the condi- 


tions do not change, the matrix does not depend on time. 


(2) If the Hamiltonian matrix is diagonalized (only its diagonal ele- 


ments are non-zero) then in this case the elements of the matrix 
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have a simple physical meaning: they are the possible values of 
the energy of the microparticle!” Because of this the Hamiltonian 


matrix may also be called the energy matrix. 


(3) The elements of the Hamiltonian matrix satisfy the relation 


Hi; = H5; (12.9) 


This relation is associated with the fact that the probability of 
the presence of a microparticle in at least one of the basic states 
obviously cannot change with time (the probabilities z C; C1). In 


order to prove this relation, we shall proceed from the equation 


d 
— y C;Ct=0 12.10 
rea (12.10) 


Taking into account that 


d : dCi dC* 
FUCiCi= Dy z oF 2G a 


and using expression (12.8), we find that equation (12.10) assumes 


>>. (Hu -H,) C;Ci=0 (12.11) 
rj 


from which the result (12.9) follows directly. 








the form 


Finally, we note that from (12.9) one can deduce that the diagonal 
elements of the Hamiltonian matrix are real numbers. This circumstance 
is in conformity with the above-mentioned role of the diagonal elements 


as values of the energy of a microparticle. 


The answer to this question may be given by considering expression 
(12.8) for the basic quantum-mechanical equation. The gist of the answer 
is as follows. Firstly, we must choose a set of basic states { ( i | }; secondly, 
we must find the form of the Hamiltonian matrix considered in the system 
of chosen basic states. After this we can make definite predictions by 


using equation (12.8). 


It is especially simple to consider cases when the number of basic 
states is equal to two. We can further simplify these by assuming that the 
Hamiltonian matrix is invariant with time. In Section 13 and 14 we shall 
consider such cases. The use of equation (12.8) which forms the basis of a 
causal description of microphenomena will be demonstrated by particular 


examples. 


13 Microparticles with Two Basic States 


The number of basic states of a microparticle is usually greater than 


10This remark will be explained later 
(see Section 13). 


What Are the Requirements 
for a Causal Description of 
Phenomena in Quantum The- 


ory? 


Examples of Microparticles 


with Two Basic States 
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two. However, there are situations when it is possible to consider only 
two basic states. Taking an example already familiar to the reader we 
shall consider the passage of a photon through a polarizer. The photon is 
in a state with definite momentum hk (and definite energy ike). In this 
case we shall only consider the possible changes in the polarization of 
the photon. Hence, one may speak of just two basic states of the photon, 


having in mind its polarization states. 


So, every time when we talk about a microparticle with two basic 
states, it is assumed that we only consider the possible changes in some 
particular “parameter” of the microparticle (for example, its polarization). 
All the other “parameters” are assumed to be known. We shall now cite 
some examples where one can speak of a microparticle with two basic 


states. These states are denoted by (1 | and ( 2 |, respectively. 


The ammonia molecule consists of one nitrogen atom and three hydro- 
gen atoms. The nitrogen atom does not lie in the plane passing through 
the hydrogen atoms (for brevity, we shall call this plane the H-plane). The 
state ( 1 | corresponds to the nitrogen atom being on one side of the H- 
plane, and the state ( 2 | corresponds to this atom being on the other side 


of the H-plane. 


The hydrogen molecule is made up of two protons and two-electrons, 
the spin states of the latter being different. Suppose we single out one of 
these spin states. Then the basic state ( 1 | may be defined as the state of 
the molecule in which the electron with the singled out spin is localized 
near one proton, and the state ( 2 | as the state with the localization of this 
electron near the other proton. Moreover, the second electron is localized 
each time near the corresponding “vacant” proton (the possibility of the 
localization of both electrons near one proton need not be considered in 


view of the strong Coulomb repulsion between electrons). 
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We represent an arbitrary state (s | of a microparticle in the form of a The Role of Non-Diagonal 
superposition of basic states ( 1 | and (2 |: Elements in the Hamiltonian 
Matrix 


(s|=(s]1)(1]4+(s]2)(2| 
or, taking into account relation (12.4), 
(s|=Cy,(1|+C,(2| (13.1) 


According to (12.8), the amplitudes C; and C; satisfy the system of equa- 


tions F 
“AG = Hy, C + Hi2 C2 
7 (13.2) 
“th = Ay, Cy + An C3 


We shall consider two cases: 
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First case. The non-diagonal elements Hız and Hz; are equal to zero (the 
Hamiltonian matrix is diagonalized). In this case the system of 


equations (13.2) splits into two independent equations 


d 
“Hh = Ay Cy 
f (13.3) 


d 
—iħ— C = Hz C 
AE 2 22 C2 


It follows from (13.3) that if at a certain time the microparticle is, 
for example, in the state (1 |, it will never appear in the state ( 2 |. 
Here, the basic states (1| and (2| are in fact the stationary states of 
the microparticle characterized by the energy values Hi; and H22 


respectively. 


Second case. The non-diagonal elements H;2 and Hz; are non-zero. In 
this case, we have a system of two mutually related equations 
[system (13.2)]. Hence if at a given moment the microparticle is, 
for example, in the state ( 1 |, it may appear in the state (2 | at 
another instant of time. The presence of non-diagonal elements in 
the Hamiltonian matrix means the existence of transitions of the 


microparticle within various basic states. 


While considering examples of microparticles with two basic states, 
we note that in the ammonia molecule, the nitrogen atom undergoes tran- 
sitions changing its position with respect to the H-plane. The protons 
in the hydrogen molecule “exchange” electrons. Note that this very “ex- 
change” of the electron pair forms the basis of a chemical bond. It may 
be said that a valency bond between two atoms arises as a result of each 
of these atoms “contributing for common use” one electron. It is these 
“shared” electrons that are responsible for the appearance of a bond be- 


tween the atoms. 
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A Brief Interlude 


-- You understand, and are acquainted with Latin, without doubt? 
-- Yes; but act as if I were not acquainted with it. 


Moliér (The Cit Turned Gentleman) 


READER: Earlier, in Section 10, it was stated that “if an object is in 
one basic state, it cannot be found in another basic state”. Isn’t this 


in contradiction with the possibility just mentioned of transitions 


between basic states? 


AUTHOR: In Section 10, we were talking about the principle of su- 
perposition of states. During the course of the discussion we did 
not take into account the possibility of the development in time of 


transition between the states giving rise to the superposition. 


READER: Please clarify this. 
AUTHOR: First of all, let us expand (13.1) in the following form: 
(s(t)|=Ci(t) (1] + C2(t) (2| (13.1a) 


We have the microparticle in the superposition state ( s (t) |. If the 
detector is activated at time t, the microparticle will be found in the 
state (1| or (2|, the probability of these events being |C; (t) |? and 
| C2(t) |’, respectively. It is significant that one event excludes the 


other - this is what we mean by the statement you quoted. 


READER: I understand the crux of the matter. If the microparticle Is 
found, for example, in the state (1 | at time t, this excludes the possi- 
bility of its being found in the state (2 | at the same time. However, 


the microparticle may be found in the state ( 2 | at another instant of 


time. 


AUTHOR: It is important that the non-diagonal elements of the 
Hamiltonian matrix should be non-zero in this case. Otherwise, 


if you find the microparticle in the state (1 |, you will never find it 


in the state (2 |. 


READER: I understand that the superposition states and transitions 


between basic states are different “subjects”. 


AUTHOR: That is true but don’t forget that when considering transi- 
tions we should take into account not only the non-diagonality of 
the Hamiltonian matrix, but also the superposition relation (13.1a). 
In this case, everything depends on the nature of the time depen- 


dence of the amplitudes C; and C3. In one case it so happens that 
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|C; |? and |C; |? do not vary with time; in this case there are no 
transitions between the basic states. In the other case |C; |? and 


| C3 |? vary with time and then we do have transitions. In short the 
question of the nature of the time dependence of the amplitudes, C4 


and C; requires a more detailed consideration. 
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We shall consider two cases exhibiting the time dependence of the Change of Amplitudes of State 


amplitudes C; and C3. in Time 


First case: The non-diagonal elements of the Hamiltonian matrix are 
equal to zero. By solving equation (13.3), we find 
i Hy, t 
C1 (t) = C1 (0) exp (==) 
m (13.4) 
Co(t) = C(0) exp (=z) 


This is the nature of the dependence on time of the amplitudes 
describing stationary states. 
From (13.4) it can be seen that the probability of finding the mi- 


croparticle in any basic state is invariant in time: 


1C) |? = |C1(0) |? 


(13.5) 
| Co(t) |? = | C2(0) |° 


If at time t = 0 the microparticle is in the state (1 |, then | C1 (t) |? = 
1 and | C2(t) |? = 0. As expected, the lifetime of a microparticle in a 
stationary state turns out to be indefinitely long. 

Second case: The non-diagonal elements of the Hamiltonian matrix are 


different from zero. We assume that we can put!! 11 This can be done if the problem under 
consideration is characterized by a certain 
symmetry; thus in the example with the 
ammonia molecule, the states ( 1 | and (2 | 
correspond to positions of the nitrogen 


Besides, if we assume that non-diagonal elements of the Hamilto- ae a. are symmetrical with respect 
to the H-plane. 


Any = Həz = Eo (13.6) 


nian matrix are real and take (12.9) into account, we can denote 


Hız = Hə; =-A (13.7) 


By using (13.6) and (13.7) we rewrite (13.2) in the form 


d 
=e = Eo Cı ped AC) 
A (13.8) 
—iħ— C; = Eo C2 — AC 
l di? 0&2 1 
This system of equations is equivalent to the following system: 

„d 
“he (Cy + C2) = (Eo — A) (Ci + C2) 


-nt OER = (Fy +A) E, 


The solutions of the latter system are of the form 


i (E 9 — A) t 
Ci Ca = aexp (=) 

i (Eo + A) t 
Ci -Ca = bexp( +A) 
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from which we obtain 








C(t) = 5 ep (A) 5 oe p( ee") (13.9a) 
C(t) = ep2 "| - i exp (ES "| (13.9b) 


Suppose that at t = 0 the microparticle is in the state (1 |. Then 
C,(0) = 1 and C,(0) = 0, from which a = b = 1. In this case 


expressions (13.9a)-(13.9b) assume the form 


C(t) = exp (P "| cos (=) 
C(t) = -iexp (5° "| sin (=) 


From these expressions it is obvious that the probability of the 





(13.10a) 





(13.10b) 


microparticle remaining in the state (1 | at time t is equal to 


At 
| C,(t) |? = cos? (=| (13.11a) 
and the probability of its appearing at time t in the state ( 2 | is 


equal to 


| Co(t) |? = sin? (=) (13.11b) 


It is useful to compare (13.5) and (13.11a)-(13.11b). While in the 
first case the probabilities | C, |? and | C2 |? are invariant in time, 
their dependence on time in the second case is quite obvious. The 
time dependence of the probabilities | C,(t) |? and | C2(t) |? deter- 
mined by relations (13.11a)-(13.11b) is shown in Figure 13.1. 


Let us compare the expressions for Cı + Cz and C; — C2, obtained in the 
previous section, with (13.4). This comparison allows us to conclude that 
the amplitudes C; + Cz and C; — C2 describe the stationary states of the 
microparticle with energies equal to (Ey) — A) and (Eo — A), respectively. 
Further, we introduce a new pair of basic states 

1 

y2 
1 

|= — ((1|+(21) 
V2 


I=- 1-21) 


(13.12) 


(it is easy to see that if the states ( 1 | and (2 | satisfy the orthonormaliza- 
tion condition (10.8), the states (I| and (H | also satisfy this condition). By 


using (13.12), we rewrite (13.1) in the form 


C-C C,+C. 
(s| = —— (|+ — (1 
V2 V2 


It can be seen from here that a transition from the basic states (1 | and 





(13.13) 


{ 2| to the basic states (I| and (II | corresponds to a transition from am- 





/e,(4)/* 
fy leer 





Atli 
C “ý Ln 


Figure 13.1: Probability of microparticle 
remaining in different states as given by 
equations (13.11a)-(13.11b). 


Diagonalization of the Hamil- 


tonian Matrix 
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C-C Cy +C 
plitudes C; and C2 to amplitudes (A=) and | z 2 } Since the 
2 2 
latter describe the stationary states of the microparticle, it follows that the 
transition under consideration is associated with a diagonalization of the 


Hamiltonian matrix. 


Ey -A 
-A Eo 


EgtA 0 
0 Eo- A 














Thus, suppose we have a certain microparticle under certain external 
conditions. Let us choose a certain system of basic states. The attempt is 
usually made to choose basic states which have a clear physical meaning 
(as we did, for example, in the case of the ammonia molecule or the hydro- 
gen molecule). The basic states chosen on the basis of such considerations 
lead, in the general case, to a Hamiltonian matrix whose non-diagonal 
elements are different from zero (the microparticle undergoes transitions 
between the basic states). Further, it is possible to go over to a new sys- 
tem of basic states for which the Hamiltonian matrix is diagonal. The new 
basic states describe the stationary states of the microparticle; the ele- 
ments of the diagonalized Hamiltonian matrix are essentially the values of 


the energy in these states. 


In the general case the non-diagonal elements of the Hamiltonian 
matrix are different from zero and so the simplifying conditions (13.6) and 
(13.7) are not applicable. In this case one must solve not the simplified 
system of equations (13.8), but the more general system of equations (13.2) 
for a microparticle with two basic states. We suggest that the reader, if 
he so desires, solves the system (13.2) himself, assuming for the sake of 
simplicity that the Hamiltonian matrix is invariant in time!” We shall 


limit ourselves here giving some results. 


The energy of the stationary states of a microparticle is determined by 


the expression 





2 
Hy, +H. Hy, — H: 
oo + Hyp Hy (13.14) 


The basic states (I| and (II | corresponding to these energy values are 
expressed in terms of the elementary states ( 1 | and (2| describing the 


equation system (13.2) in the following manner: 


(I] = ay (1 | +51 (2 | 
(I| = ag (1] + b2(2| 


(13.15) 


General Case 


12Such a solution is given, for example, 
in 

Feynman, R. P., Leighton, R. B., and 
Sands, M. (1965). Lectures on Physics, 
volume 3. Addison-Wesley Reading, Mass 
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where 
Jay? + [bi /? =| a2 |? +| bo? =1 
aq Hız 
b E-H (13.16) 
a Hz 
b> Eq — Hx 


It can be easily seen that if Hı = Hg. = Eo and Hz = Hz = —A, 

the result (13.14) gives E, = Eo + A, the superpositions (13.15) turn 
into (13.12). In other words, we arrive at the simplified case of the non- 
diagonal Hamiltonian matrix discussed above in detail. But if Hı? = Hz; = 
0, the result (13.14) gives Ey = Hy1, En = H22 - we arrive at the case of the 
diagonal Hamiltonian matrix ({I| = (1|, (I| = (2). 


We recall that the basic states = (1|and= (2| of the ammonia Example of the Ammonia 
molecule were chosen using graphic physical considerations: they cor- Molecule 
respond respectively to the position of the nitrogen atom on one side of 
the H-plane and on the other. Since these positions are symmetrical, we 
may take Hj; = H22 = Eo. Assuming further that the elements Hız and 
Hp, are real (Hi2 = H2, = —A), which, as it turns out, does not involve 
the loss of generality in this case, we arrive at the. situation to which the 
simplified system of equation (13.8) corresponds. It follows from this that 
the energy levels of a molecule are essentially Ey + A and Ey — A. We 
emphasize that if no transitions took place between the states (1 | and 
(2|, there would have been only one level Eo in place of the levels Ey) + A 
and Eo — A. It would have been doubly degenerate since there would be 
two states corresponding to it. It may be said that transitions between 
the states (1| and (2 | (associated with “pushing” of the nitrogen atom 
through the H-plane) correspond to a removal of degeneracy, i.e. to a 


splitting of the level Ep into two levels Ey + A and Eo — A. 


Further, let us assume that the ammonia molecule is placed in a static 
electric field with intensity & which is perpendicular to the H-plane. 


Denoting the electric dipole moment of the molecule by d, we write 


Ay, = Eo + Ed 
(13.17) E 
Həz = Eo = éd 
Now the positions of the nitrogen atom on either side of the H-plane are EA 
no longer physically symmetrical (Hi; + H22). Assuming that Hy; = H21 = . > 
—A as before, we write the system of equations (13.2) for the case under E;A 


consideration: 





0 6 


(13.18) Figure 13.2: Qualitative dependence of 


d 
—ih— C = -AC, + (Eo — &d) C2 energy levels of ammonia molecule on the 
dt applied electric field intensity. 


d 
-ih Cy = (Eo + éd) Cı — AC, 


By using (13.14),we obtain the following expressions for the energy levels 
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of the molecule in a static electric field: 


E; = Eo + VA? + &2d2 
Eq = Eo — VA? + &2q2 


(13.19) 


Figure 13.2 shows the qualitative dependence of the energy levels of 
ammonia molecule on the field intensity. It can be easily seen that, the 
effect of “throwing” the nitrogen atom through the H-plane is important 
for relatively small field intensities. In strong fields, when the levels 


diverge considerably, this effect becomes insignificant. 


14 The Electron in a Magnetic Field 


It is well known that the projection of the electron spin momentum 
in any direction may assume only two values (—f/2 and +h/2). This 
allows us to treat the electron as a microparticle with two basic states. 
The problem of an electron in a magnetic field is of great practical interest. 
Besides, this problem is also quite interesting from a methodical point 
of view: its analysis enables one to get acquainted with not only the 
general nature of a system with two basic states, or, in other words, the 
two-level system, through a physically suggestive example, but also with 
the general approach to the analysis of such systems. Let us first fix the 
direction of the coordinate axes (including the direction of the z-axis). For 
the basic states (1| and (2 | we choose states for which the projections of 
the electron spin on the z-axis are equal to —f/2 and +h/2, respectively. 
We switch on a static magnetic field (we denote the value of the magnetic 


induction by B) and consider the following two cases. 


First case: the magnetic field is directed along the z-axis (By = By = 0). 
In this case the basic states ( 1 | and (2 | are stationary; the state 
{1| has energy —y B, and the state (2 | has energy u Bz (p denotes 
the magnetic moment of the electron). The amplitudes C; and C2 


satisfy two independent equations of the type (13.3): 


d 
iħ— Cı = u Bz C 
Hegy 15H 1 


j (14.1) 
—iħ— C; = u B,C 
ap T KEERA 
The Hamiltonian matrix of the electron is of the form 
—uBz 0 
[a] =| (14.2) 
0 pB, 








Second case: the magnetic field is in an arbitrary direction. In the first 
place, we note that irrespective of the direction of the field (in 
other words, irrespective of the choice of the direction of the coor- 
dinate axes) the energy levels of the electron are determined by the 


expressions —yB and uB; while in the previous case we had to take 


The Hamiltonian Matrix for an 


Electron in a Magnetic Field 
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B = B,, in this case we must have B = ,/B2 + B3, + B2. Thus 
Ei = -u| B2 + B? + B? 


(14.3) 
Ey = pB} + By + B? 
Note that 
E= -Er (14.4) 


Further we make use of relation (13.14). By taking (14.4) into account, we 
may assume that H,; + Hz. = 0. As a result, by combining (14.3) and (12.9), 


we get 
(= - H22 


2 
- ) +| Hy [2 = p (B2 + B3 + B2) (14.5) 


We shall assume a linear relationship between the elements of the Hamil- 
tonian matrix and the field. It turns out that this fairly natural assumption 


allows us to derive the following relations from (14.5): 


Hı = —H Bz, 
Hz = p Bz, 
Hz = H} =H (By + iBy) 


Thus, the Hamiltonian matrix of an electron in a magnetic field -has the 


following general form: 


e- 


It can be easily seen that when B, = By = 0, the matrix (14.6), as expected, 


-4 Bz —p (Bx — iBy) 


14.6 
—p (Bx + iBy) pBz g 





coincides with the matrix (14.2). 


By using (14.6) we write the system of equation (13.2) for the case 


under consideration: 


d 
-iñ 5 [B.C + (Bx — iBy)C2| 
i (14.7) 


d 
-ihg =- [(Bx + iBy)C; — BzC2| 


In conclusion, we note that although the above argument was conducted 
for the case when the Hamiltonian matrix is invariant in time, the results 


(14.6) and (14.7) are valid even when the magnetic field varies with time. 


Let the direction of the magnetic field be determined by the polar angle 
0 and azimuth ¢ (Figure 14.1). We shall assume that the electron spin is 
directed along the field; consequently, the electron is in the stationary 
state (I| with energy E; = —yB. According to (13.15), the state (I| can be 


represented as a superposition: 


(I] = a1 (1] + b1 (2 | (14.8) 


Projection Amplitudes 
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where, as we recall, (1 | and (2 | are the states in which the projections 
of the electron spin on the z-axis are equal to +h/2 and —f/2, respectively, 


and the coefficients a; and b, are determined by the relations (13.16): 


ai Hı2 


b Er-Hn (14.9) 
la|? +|b |? =1 


By using (14.6) we write 


Ay, = —p B; = uB cos 0 
(14.10) 
Hız = —p (Bx — iBy) = —pBsin 0 exp(—ig) 


Substituting (14.10) into (14.9), we find that 


a, _ sin exp (—ig) 


bı 1-— cos 0 





Taking into account that | a; |? + | b; |? = 1, we arrive at the following final 


z 0 —iọ 
ay, = COS (5) exp (=) 
bı = sin (5) exp (2) 

2 2 


Thus, we have determined the amplitudes of the probabilities that the 


result: 


(14.11) 


electron has its spin directed along the z-axis (amplitude a1) or in the 
opposite direction (amplitude b1) by assuming that the electron spin is 
directed along the field (i.e. in the direction determined by the angles 0 
and ọ). 


Note that (14.11) does not contain the magnetic induction B. It is ob- 
vious that the result (14.11) must be valid in the limiting case B — 0. In 
other words we may exclude the field from consideration and interpret 
(14.11) in the following way. It is clear that the direction of the electron 
spin is determined by the angles 0 and g. In this case the amplitude of the 
probability that the electron spin is along the z-axis is a, and the ampli- 
tude of the probability that the electron spin is in the opposite direction is 
bı. Expression (14.8) should be treated in this case as an expansion of the 
spin state (0, ọ | in terms of the spin states ( z | and ( —z |: 

(0, | = cos (5) exp (-2} (z|+sin (5) exp (2) (-z| (14.12) 
The amplitudes of the states occurring in (14.12) 


(0, |z} = cos (3) exp (-#) 


; (14.13) 
(0,@|-z) = sin (3) exp (2) 





ay 


Figure 14.1: Direction of the magnetic 
field. 
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are called projection amplitudes. By using projection amplitudes, we may 
predict the result of the following experiment. Let an electron beam, 


polarized in a direction given by the angles 0 and 9, pass through some 


“filter” which only allows through electrons whose spin is along the z-axis. 


In this case the amplitude of probability of an electron passing through 
the apparatus (through the “filter”) is (0, ọ | z). The projection amplitude 
here plays the role of the amplitude of the electron transition from the 


state (0, ọ | to the state (z |. 


Let the direction of the electron spin be given by the angles 0 and g 
(the electron is in the state (0, ọ | ). This state can be represented in the 
form of superposition (14.12) of the states (z | and (—z |. Suppose that at 
time t = 0 we switch on a magnetic field B which is directed along the 
z-axis. Now the states (z | and (—z| become stationary states. Using this, 


we write [see (13.4)] 





—ipB,t 
(0.912) =Ciexp| 5 








nia (14.14) 
iuBz 
(0.1-2) = Crexp| 5 ) 
Comparing (14.14) with (14.13), we conclude that 
0 —i 
Cı (0) = cos (5) exp (=) 
(14.15) 
C2(0) = sin p ex H, 
? 2P |2 
It follows from this that in time t after the magnetic field has been 
switched on the projection amplitudes assume the form 
0 i 2uBzt 
(0, p(t) | z) = cos (5) exp (-5 l + 7; )) 
(14.16) 





(0, p(t) |-z) = sin (5) exp (5 l + arat) 

Thus, the switching on of a magnetic field along the z-axis does not 
change the polar angle 0 but changes the azimuth g, the change in g 
being proportional to the time interval t which elapses after switching 
on the field. This means that the spin of an electron precesses around the 
z-axis (around the direction of magnetic field) with a constant angular 
velocity. It can be easily seen that the angular velocity of the spin preces- 
sion is given by the relation 

2B, 
hi 


(a) 





(14.17) 


We move one step further by ignoring coordinate axes. Suppose that 
at time t = 0 the direction of electron spin forms an angle 0 with the 
direction of the magnetic field. This angle will remain constant with 


time, but the electron spin will precess around the field direction with an 


Precession of the Electron Spin 
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angular velocity 
2B 
y= HB 


h 
We further suppose that the magnetic field is varying with time (in the 


(14.17a) 


general case, both the direction and the magnitude of the vector Bare 
varying). A change in the field leads to a corresponding change in the 
electron spin precession: a change in the magnitude of the magnetic field 
results in a change in the angular velocity of precession, while a change 
in the direction of the field causes a change in the direction around which 


the precession takes place. 


A knowledge of the precession enables one to predict changes in the 





electronic state over a given period of time. We shall consider a simple ex- x 
ample (Figure 14.2). Suppose that at the instant t = 0 the field is directed 

along the z-axis [vector B (0)], and the electron spin is in the zy-plane Figure 14.2: The precession of the electron 
and forms an angle 0o with the direction of the field. Thus the initial state ERA E 

of the electron (0(0), ¢(0}) is determined by the angles 0(0) = 0) and 

(0) = 2/2. We shall assume that the magnitude of the field does not 

change with time, but that the direction of the field changes. Suppose 

after time tı = 2f/2yB the field becomes directed along the y-axis [vector 

B (tı) in Figure 14.2]. What will be the state of the electron at the t,? It is 

clear that if the field direction had remained unchanged, the end point of 

the electron spin vector, while precessing around the z-axis, would have 

described a semicircle during the time zfi/2y B, thus shifting from point 

sı to point sz. If we take into account the variation in field direction, we 

shall find that it has shifted not to the point sz but to the point s3. Con- 

sequently, the required state of the electron (0 (t1), o (t1) | ) would be 


determined by the angles 


A(t) = =- % 


WINDS 


p(t) = 


According to Feynman’, Generalization of the Problem 


of an Electron in a Magnetic 
it is interesting that the mathematical ideas we have just gone over for the 


Field to an Arbitrary Two-Level 


spinning electron in a magnetic field can be applied to any two-state sys- 


System 


13 Feynman, R. P., Leighton, R. B., and 
electron, any problem about two-state systems can be solved by pure geom- Sands, M. (1965). Lectures on Physics. 


etry ... If we can solve the electron problem in general, we have solved all volume 3. Addison-Wesley Reading, Mass 


tem. That means that by making a mathematical analogy to the spinning 


two-state problems. 
Let us clarify these remarks. 


We shall consider some two-level system with the basic states ( 1 | and 
(2|. We assign some vector to each state of the microparticle. A choice 
of the basic states ( 1 | and (2 | in this case is equivalent to a choice of the 


z-axis (as if these two states correspond to the two z-projections of the 
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electron spin). Suppose that the microparticle is in the state (s(0) | at the 
initial moment of time. We assign to this state a vector whose direction is 


determined by the angles 0(0) and ¢(0): 


{s(0) | = (8(0), (0) | (14.18) 


In order to find the angles 0(0) and ~(0) we must expand the state (s(0) | 
in terms of the basic states ( 1 | and ( 2| and use for the coefficients of ex- 
pansion expression (14.13) for the projection amplitudes. This expansion 


is of the form 


(s(0) | = cos (©) ap (-22) Fiesa (©) - (22) A 
(14.19) 


Further, let us turn to the Hamiltonian matrix of the microparticle. First 








of all we shift the zero point of the energy in such a way that it is located 
precisely half-way between the two energy levels or, in other words, so 


that the condition (14.4) is satisfied. In this case 
Hy, + Ho. = 0 (14.20) 


We now formally introduce a vector p B (itis in no way associated with 
any magnetic field!) such that its projections on the coordinate axes 
(remember that the axes are determined by the choice of the basic states) 


satisfy the conditions 


Ay, = -4 Bz 
(14.21) 
Hz = -4 (Bx ~ iBy) 
By using (14.21) we determine 
2pt,/B2 + Bi + B? 
ee (14.22) 


h 


In order to solve this two-level problem, i.e. in order to determine the 
change in the (s | state during certain time t, we must consider the preces- 
sion of the vector ( 6, g | around the direction B with an angular velocity 
w. If the Hamiltonian matrix is time dependent, both the direction of 

B and the magnitude of the angular velocity of precession will change 
accordingly. After a time t, the microparticle will be in the state (s(t) | 
defined by the angles 0 (t) and g(t), which may be determined if we know 
the initial values 0 (0) and ọ (0) and the precession (in exactly the same 
way as was done in the example given at the end of the preceding subsec- 
tion). The transition from the angles 0 (t) and ọ (t) to the required final 


state (s(t) | is accomplished by using the familiar superposition: 


| exp (-28) (1|+sin (2) exp (22) (2| 
(14.23) 











(s(t) | = cos | 
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Applying these remarks to the example of the ammonia molecule con- 
sidered in Section 13, we must shift the zero point of the energy by an 
amount Ep, and go over (for the sake of convenience) from the basic states 
{1| and (2| to the basic states (I| and (I|. As a result, we get in place of 
(13.18) the following system of equations: 


d 
ha = -AC;+ EdCy 


y (14.24) 
“th Cn = dC, — ACy 


This system is convenient for drawing the analogy with the electron in 

a magnetic field. Comparing (14.24) and (14.7), we find that the quantity 
A corresponds to —y B, and the quantity &d to -u By: Consequently, we 
have to consider the precession of the vector describing the state (s| of 
the molecule in the “magnetic field” which is made up of two components: 
a constant component along the z-axis, associated with the effect of the 
“throwing” of a nitrogen atom through the H-plane, and a component 
along the z-axis, associated with the electric field. The latter component 


may, obviously, vary with time. 


PAULI SPIN MATRICES. In conclusion, we shall mention the Pauli spin matrices which are 


widely used in the quantum mechanics of two-level systems. These matrices are of the form 


aal i a= | = 0 
1 0 i 0 0 -1 





By using these matrices, we can rewrite the expression for the elements of Hamiltonian matrix of all 


electron in a magnetic fiald (14.6) in the following form: 
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(14.25) 


Hij = -4 (of, By + of! By + 07, Bz) (14.26) 


By considering o*, o”, o as components of some matrix vector G. we can rewrite (14.26) in a form 


which is independent of the choice of the coordinate axes: 


Hij = -p õi; B (14.27) 


Pauli spin matrices are useful because any second-order matrix (in particular, the Hamiltonian matrix 


of any microparticle with two basic states) may be represented as a superposition of these matrices. 


The Pauli spin matrices introduced for an electron in a magnetic field have proved to be convenient for 


considering a wide range of two-level problems. This is not surprising when we consider the possibility, 


discussed above, of generalizing the problem of an electron in a magnetic field to arbitrary two-level 


systems. 


15 The Wave Function 


At the very outset, let us make it clear that this section hardly adds 
anything new to the physical foundations of quantum mechanics consid- 
ered in this chapter. In fact, we are already able to summarize the physical 


aspects of the theory and go over to its mathematical side, based on the 
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use of linear operators. However, before doing so, it is worthwhile to in- 
troduce the concept of the wave function. Wave functions are widely used 
in the existing literature on quantum mechanics; hence it is important 
that the reader should be aware of the “position” of the wave function in 
the above description of amplitude concepts. Until now, the wave func- 
tion, being essentially the amplitude of state, existed in this picture in an 
implicit form; we shall now make it explicit. Besides, one must remember 
that while changing over to the mathematical apparatus of quantum me- 
chanics, it is more convenient to use the wave function rather than the 


amplitude of state. 


Let (x | be the state of a microparticle corresponding to its localization The Wave Function as the 
at a point in space with the coordinate x (for simplicity we consider the Amplitude of State 
one-dimensional case). Then (s |x ) may be considered as the probability 


amplitude that a microparticle in the state (s | has the coordinate x. 


However, we must make this a little more precise. When considering 
the probability of any spatial localization of a microparticle, we must take 
into account the continuity in the variation of a spatial coordinate. Hence, 
instead of the probability of finding a microparticle precisely at the point 
x, we must consider the probability of finding it on the interval from x to 


x + dx. Denoting this probability by dw, (x), we have 
dw,(x) =| (s|x) |?dx (15.1) 


Consequently, the quantity (s |x) is, strictly speaking, not the amplitude 
of probability, but the amplitude of probability density. 


In literature, the quantity (s |x) is referred to as the wave function and 


is expressed, for example, by (x). Thus, 
Ws(x) = (s |x) (15.2) 
Using (15.2) we can rewrite (15.1) as 
dws(x) = | Ys(x) |'dx (15.3) 


It follows from (15.3) that | W(x) |? is the probability density of finding a 


microparticle with the state (s | at the point x. 


From a mathematical point of view, the wave function in y,(x) is a 
parametric function, the parameters being the quantities which are pre- 
cisely defined in the state (s |. Taking into account the earlier remarks 
on the structure of amplitudes of states, we may state that the quantities 
of one complete set serve as the argument for the wave function, its pa- 
rameter being the quantities of another set. It is often said that the wave 
function W(x) is an eigenfunction of the quantities of the s-set, given in 


the representation determined by the quantities of the x-set (or simply, in 
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the x-representation). 


Wave functions are frequently used in practice in the x-representation 
(coordinate representation). However apart from the x-representation, 
other representations are obviously also possible. In this connection, the 


concept of the wave function must be generalized: 


Yalp) = Cal p) (15.4) 


The function ~(f) is the eigenfunction of the quantities of the a-set, 

given in the f-representation. If the values of the -set change discretely, 
Yal P) is the amplitude of the probability that the state ( f | is represented 
in the state (æ |. In the case of continuously changing values of the f-set, 


Yal P) is the amplitude of the given probability density. 


By giving the wave function Ya(f) we give the exact values of the 
quantities in the a-set and probable values of the quantities in the -set. 
Correspondingly, by giving the function pa(y) we give the exact values of 
the a-set and probable for the values of the y-set. It could be said that the 
function Ya(f) describes the state (æ | in the f-representation, while the 
function @_(y) describes the same state, but in the y-representation. The 
fact that different functions ~a(f) and ga(y) are used for describing the 
same state (q | , indicates that there must be some connection between 
them. This connection is expressed through the principle of superposition 


of states. Assuming that y-values change discretely, we can write 
Yalp) = X Palri) xyi(B) (15.5) 


It can be easily seen that (15.5) is the expression for the superposition of 


amplitudes of states: 
(alb) = >) (aly) (r:l) (15.5a) 
If the y-values vary continuously, we get in place of (15.5) 


jalp) = / pal) xy(P) dy (15.6) 


Let us consider the eigenfunction of the quantities of a certain set given 
in the representation of the quantities of the same set. If these quantities 


change discretely, we have, according to (10.8), 
Wa, (aj) = dij (15.7) 
If, however, the quantities vary continuously, we get in place of (15.7) 
Py (a) = 6(a- a’) (15.8) 


where 6(a@ — a’) is the so-called Dirac delta function, which is a general- 


Generalization of the Concept 


of The Wave Function 
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ization of the Kronecker’s symbol for the case of continuously varying 


quantities. 1 


The delta function is determined in the following 


d(a-a’)=0 at a +a 


e (15.9) 
f a-a’)da=1 


Strictly speaking, it is impossible to plot the function ô(« — a’) because 
it would involve drawing an infinitely narrow and infinitely high peak 
at the point a = a’ with a finite “area” under it equal to unity. One of 
the most important properties of the delta function, which can be easily 


derived from its definition (15.9), can be written as follows: 
J f(a) la- a’) da = f(a’) (15.10) 


where f(a) is a bounded function, continuous at the point œ = a’. Assum- 
ing that the a-values are discrete and the /-values vary continuously we 


rewrite (15.6) in the form of 


Yala) = | Ya (B)%p (ai) a (15.11) 
By using (9.33) we can write 


® g (aj) = We, (B) (15.12) 


From (15.12) and (15.7), we obtain from (15.11) the condition for the or- 


thonormalization of the eigenfunctions Ya; (p) 


EZOO = 6 (15.13) 


If the a-quantities vary continuously, we must use (15.8) instead of (15.7). 
In this case the condition of orthonormalization of the eigenfunctions 


assumes the form 


Val B) V2(B) 4B = 5(a— a’) (15.14) 


As an example, we consider the case of a freely moving microparticle. 


For simplicity, we assume that it has zero spin. The wave function in 


coordinate (three-dimensional) representation has the form”? 


Wp, P) = (Po |P) 
i) (15.15) 


3 
= (2h)? a 
(ent) exp [ 7 


where po is the momentum of the microparticle and ř is its spatial coordi- 


14The delta function is discussed in 
detail, for example, in 


Schiff, L. I. (1968). Quantum mechanics. 
McGraw-Hill, N.Y., 3rd edition 


Condition of Orthonormaliza- 


tion of Eigenfunctions 


The Wave Function of a Freely 


Moving Microparticle 


15 We shall derive this result later (see 
Section 20). 
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nate. The function yz, (7) is an eigenfunction of the momentum, given in 
the coordinate representation. It describes the state in which the momen- 
tum components of a microparticle have definite values, while the spatial 


coordinates may be assigned only probable values. 


Switching over from the coordinate to the momentum representation 


and making use of (9.33) we get 


PnP) = (7o|B) = (P| Po)” 
acer 15.16 
= (20h)~? exp (-7£) ) 


The function 9;,(p) is an eigenfunction of the coordinate To of the mi- 


croparticle given in the momentum representation. 


It can be seen from (15.15) and (15.16) that the states of a freely moving 
microparticle are described by wave functions in the form of plane waves 


(in coordinate or momentum space). 


The functions Yz, (F) and 9;,(p) satisfy the condition (15.14) of or- 


thonormalization. This can be verified by using the integral representa- 


tion of the delta function!® 
1 r i 
ôla) = oa i exp(iaf) dp (15.17) 


(the quantities a and £ represent the components of the spatial coordinate 


and the momentum of the microparticle). 


16 Quantum Mechanics as a Qualitative Leap in Man’s Knowl- 


edge of the Laws of Nature 


Of course it is impossible to distinguish sharply between 


natural philosophy and human culture. The physical sciences 


are, in fact, an integral part of our civilization, not only 


because 


has so completely changed the material conditions of life, 


but also because the study of these sciences has contributed 


so much to clarity the background of our own existence. 


N. Bohr 


Although quantum mechanics deals with microparticles, its signifi- 
cance is by no means limited to microphenomena. In our endless quest 
for understanding and perfecting our knowledge of the laws of nature, 
quantum mechanics represents an important qualitative leap. Without a 


comprehension of the importance, and radical (we could even say revo- 


our ever-increasing mastery of the forces of nature 





Figure 15.1: The so called sinc function 
sin x 





x 


161 et us consider the function 
sin(ga) /a@ (Figure 15.1). Irrespec- 
tive of the value of parameter g, 


fo} 


J da sin(ga)/ma = 
crease g. This will bring the points A in 
the figure closertoa@ = 0, and raise the 
point B on the ordinate axis. In the limit 
asg — oo we get an infinitely narrow 
and high peak whose integral is equal 

to unity. This is just the delta function. 
Thus, (œ) = i, sin(ga) /ma. We 


1. Let us in- 


can easily get (15.17) from this since 


sin(ga)/ma = > f exp(iaf) dp. 
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lutionary) nature of this leap, it is impossible to understand the modern 

physical picture of the world. In this section, an attempt has been made to 
look at quantum mechanics from this point of view. This may serve as a 
logical conclusion to the chapter, which has been devoted to the physical 


foundations of this astonishing theory. 


The expression “crazy theory” as one which is “crazy enough to be 
true” was once coined by Bohr. This expression reflects the stunning im- 
pression produced on Bohr’s contemporaries by the astonishing physical 
discoveries made at the beginning of the a0" century, discoveries which 
could not be confined within the framework of classical concepts. It be- 
came obvious that an explanation of these discoveries required radically 


new ideas and a new approach. 


In Section 2 we considered two fundamental ideas of quantum me- 
chanics — the idea at discreteness and the idea of duality. Returning, in 
our imagination, to the beginning of the century, we could call the first 
idea “incomprehensible” and the second, “not properly understood”. The 
introduction of discreteness to the physical picture of the world led to in- 
comprehensible and, apparently, logically controversial quantum “jumps” 
The idea of duality, which asserts the specific nature of microparticles, 
eliminated the contradiction of quantum “jumps” by suggesting a “ma- 
noeuvring” between the “particle” and the “wave” concepts. But the 
meaning of the wave concept introduced here remained in fact unclear for 
a very long time. These two “absurd” ideas led to the emergence of the ex- 
travagant “uncertainty relations” which resulted in a different outlook on 
even such fundamental concepts as “energy”, “momentum” and “angular 


momentum”. 


Quantum mechanics was born under circumstances of a significant 
breaking up of physical traditions. It called for a rejection of many usual 
and accepted notions such as the strict continuity of the spectra of val- 
ues of physical quantities, the trajectory as an essential attribute of the 
motion of an object, Laplace determinism as the basic form of expression 
of the principle of causality, the possibility of an infinite detailization of 
the structure of an object or of a phenomenon with respect to time, the 
possibility of distinguishing between two objects, however similar to each 
other, under any circumstances, the belief that it is always possible, at 
least in principle, to disregard the measuring instrument when conducting 
any measurements, etc. (all these questions have been analysed in detail 


in the preceding sections). 


It is difficult to recall any other period in the history of physics when 
such a serious and large-scale revision of physical concepts was carried 


out. According to Bohr, 


“Crazy Ideas” 
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...the new lesson which has been impressed upon physicists stresses the 
caution with which all usual conventions must be applied as soon as we 
are not concerned with everyday experience ...In the study of atomic 
phenomena we have repeatedly been taught that questions which were 
believed to have received long ago their final answers had most unexpected 


surprises in store for us. 


The revision of the concepts and the rejection of many accepted no- The Essence of Quantum Me- 
tions could well be considered as a “negative aspect” of quantum mechan- chanics 


ics. Let us now consider its “positive aspect”. 


If we try to summarize the main positive knowledge imparted by quan- 
tum mechanics to man in his search for learning about his surroundings, 


the following two important points stand out: 


First: quantum mechanics showed that the basic laws of nature are not 
dynamic but are statistical, and that the probabilistic form of causal- 
ity is the fundamental form while the classical determinism is just 
its limiting (degenerate) case. 

Second: quantum mechanics revealed that probability in nature should 
not be dealt with as in classical statistical theories. It was found that 
in certain cases it is not the probabilities of events that should be 
summed, but rather the amplitudes of these probabilities. This 
leads to the interference of probability amplitudes. 


Thus we emphasize firstly the probabilistic character of the laws of 
nature (the pre-eminence of statistical laws) and, secondly, the special 
relations among the probabilities, which assume not only the summation 
of the latter, but also the specific interference effects. In our view, it is in 
this that the main importance of the information one gets from quantum 


mechanics is to be found. 


As Born pointed out in Physics in my generation,‘ the statistical meth- 17 Born, M. (1956). Physics In My Gen- 
eration: A Selection of Papers. | Pergamon 
Press, London 


ods found wider applicability with the development of physics. As re- 
gards modern physics, it is completely based on statistical foundations. 
In Born’s view, it is the quantum theory that established the closest links 
between statistics and the basic aspects of physics. This should be con- 
sidered as an important event in the history of human knowledge, with 
consequences reaching far beyond the limits of science. It is sometimes 
said that the fundamental difference between quantum mechanics and 
classical mechanics is determined by, the statistical nature of the former 
and dynamic nature of the latter. Upon careful consideration, this appar- 
ently bland and irrefutable statement turns out to be incorrect. While 
revealing the pre-eminence of statistical laws in physics, quantum me- 
chanics shows at the same time that dynamic laws with their unique 


predictions are, as a matter of fact, a special (degenerate) case of prob- 
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ability laws. In this respect not only quantum mechanics, but classical 
mechanics as well, must be, strictly speaking, formulated in the language 
of probabilities!® The qualitative difference between quantum mechanics 
and classical mechanics (or classical physics in general) depends on how 
the relations among probabilities are considered. It has been mentioned in 
Mayakishev the main difference between quantum mechanics and classi- 
cal mechanics does not lie in the statistical nature of the former. It lies in 
the fact that it is not the probability but its amplitude, the wave function, 
that is of primary importance in quantum mechanics. This leads to the 
interference of probabilities, an effect which does not have an analogy in 


classical mechanics. 


Developing the above ideas, let us single out the following points: 


(a) the special interrelations among quantum-mechanical states and 
the resulting specific nature of quantum-mechanical description of 


phenomena; 


(b) the specific nature of the application of probabilities in quantum 


mechanics; 
(c) the special role of interference in quantum mechanics; 


(d) the complementarity principle as a logical foundation of quantum 


mechanics, and 


(e) the dialectical nature of quantum mechanics. 


We shall consider these questions one by one. 


According to Feynman, one of the most outstanding achievements of 
quantum mechanics lies in the fact that it allows so much to be extracted 


from so little. 


The reader has already found out how much can be achieved from the 
phenomenon of interference of amplitudes (see Section 9), on the basis of 
the principle of superposition of states (see Section 10), and from a consid- 
eration of the simplest quantum-mechanical systems, i.e, microparticles 
with two basic states (see Section 13, 14). The relative formal simplicity of 
the description of microphenomena is connected with the specific nature 
of this description. Remember that for a quantum-mechanical description 
we must know firstly the basic states and, secondly, the Hamiltonian ma- 
trix, which reflects the physics of the phenomena under consideration. A 
simplification in the description can be achieved because of the following 


two circumstances. 


Firstly, it is important that the number of basic states, and consequently 


the number of elements of the Hamiltonian matrix required for 


18 This point of view is systematically 
analysed by Myakishev (1973) where 
it is stated, in particular, that Feynman’s 
concepts of path integrals in fact converts 
the principle of least action into the princi- 
ple of maximum probability, i.e, it proves 
that the fundamental dynamic principle is 
essentially statistical in nature. 

Myakishey, G. Y. (1973). Dynamic and 
Statistical Laws in Physics (in Russian). 
Nauka, Moscow 


The Specific Nature of 
Quantum-Mechanical De- 


scription of Phenomena 
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describing a definite phenomenon, should not be large. Thus, in 
the examples given in Section 13, 14, this number was equal to two. 
Here the contradiction regarding the diversity in the possible states 
of the microparticle does not arise, since according to the principle 
of superposition either of them may be represented in the form of 
some superposition of basic states. The principle of superposition 
itself is the deciding factor, which permits us to manage usually 


with a small number of states selected as the basic states. As Dirac 


wrote!’, 19 Dirac, P. A. M. (1958). The Principles 
of Quantum Mechanics. | Clarendon Press 
... in departing from the determinacy of the classical theory a great Oxford 


complication is introduced into the description of Nature, which is 
a highly undesirable feature. This complication is undeniable, but it 
is offset by a great simplification provided by the general principle 


of superposition of states. 


It has been noted earlier (see Section 10) that in classical physics 
all states of a particle should be considered as mutually orthogo- 
nal, or, in other words, as basic states. Because of this, the above- 


mentioned simplifying situation is impossible here in principle. 


Secondly, the relative simplicity of superposition relations allows us to 
draw analogies among microparticles having the same number 
of basic states and reduce all the real problems in practice to a 
consideration of the two-level problem, three-level problem, etc. It 
has been shown in Section 14 how an arbitrary problem with two 
basic states can be reduced formally to the problem of an electron 


in a magnetic field. 


Of course, it cannot be deduced from all this that in general “quantum 
mechanics is simpler than classical mechanics”. Certainly it is simpler 
in the above-mentioned sense. However, it has sufficient problems of its 
own, especially problems connected with a rational choice of the system 
of basic states and with finding the form of the Hamiltonian matrix. It 
is hardly necessary to recount all the difficulties which invariably result 
from the necessity of departing from graphic representations of accus- 
tomed concepts. That is the way things are. Hence it wouldn’t be wise 


1? 


to say that “quantum mechanics is simple!” And yet one must remem- 
ber that the peculiar relations that exist among different states of a mi- 
croparticle and appear in the specific principle of superposition of states 


considerably simplify the quantum-mechanical description of phenomena. 


Quantum mechanics forces us to take a fresh look at the well-known Probability in Quantum Me- 
theorem of addition of probabilities for incompatible events. We have chanics 
to consider not only the incompatibility but also the distinguishability 
of the events. This is where the novelty of the approach lies. It is well 


known that in the probability theory used in classical physics, as well as 
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in engineering, it is always implied that events are distinguishable. 


In order to demonstrate the specific nature of the application of prob- 
ability in quantum mechanics, we make use of the example considered 
in Sections 9 and 10 of the scattering of bosons of the same type by each 


other. We recall the notations introduced in these sections: 
9 (9) =(fils:) «fal S2) 
the probability amplitude of one event (one transition), 
o (x-0) =< folsid {fil sz) 


the probability amplitude of the other event, w — the probability of simul- 
taneous activation of both detectors. Since microparticles of the same 
type are scattered by each other, the question of their distinguishability 
leads to the distinguishability of the initial states ( sı | and ( s2 |. In this 


connection three cases may be isolated: 


First case: The events are completely indistinguishable. 


This means that the initial states are the same, and 
I {si |s2)|=1 (16.1) 
In this case, we get [see (9.17)] 


w=|9(0)+9(1-8)/’ (16.2) 


Second case: The events are partially distinguishable. 


This means that the amplitude ( sı | s2 ) satisfies the condition 
0<(s;|s2)<1 (16.3) 
In this case, we get [see (10.7)] 


w =| 9 (0) + o (2-8) |+ (si 152) I? Co (8) g" (a - 8) + 9°(8) p(x- 0)) 


(16.4) 
Third case: The events are completely distinguishable. 
This means that the amplitude (sı | s2 ) satisfies the condition 
(sıls2)=0 (16.5) 
This gives us [see (9.16)] 
w=] (0)? +lp (2-6) |? (16.6) 


Thus we find that the theorem of the addition of probabilities “holds” only 


in the third of the above-mentioned cases, i.e, in the case of completely 
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distinguishable events. From (16.5) it follows that the states (s;| and (s2| 


must be mutually orthogonal” in this case. In the remaining cases the 20t is worthwhile mentioning here that 
the mutual orthogonality of all states of 

a classical object stipulates a complete 
completely indistinguishable, the amplitudes of the probabilities should distinguishability of events and, as a 
result, leads to the theorem of addition of 
probabilities. 


theorem of the addition of probabilities does not hold. If the events are 


he summed. But if the events are partially distinguishable, we must use 
the more complicated relation (16.4). When deriving this relation, both 
the law of the addition of amplitudes and the theorem of the addition 
of probabilities have been used in the same way as in Section 9 when 


deriving (9.10). 


It can be easily seen that the result (16.4) based on the addition of 
probabilities as well as the-addition of amplitudes is the most general 
one. When the condition (16.5) is satisfied, it at once leads to the “purely” 
classical case of addition of probabilities while condition (16.1) is fulfilled 


“purely” in the case of addition of amplitudes of probabilities. 


It should be emphasized that the very possibility of the existence of 
the general result (16.4) is caused by the presence of superposition bonds 
between the states ( sı | and < sz | [see (10.6) in this connection]. Thus we 
track the active connection between the quantum-mechanical principle of 
superposition of states and the specific nature of the application of prob- 
ability in quantum mechanics. The superposition relations among states 
and the interference of probability amplitudes have the same physical 


nature. 


We draw the reader’s attention to the following fact. In order to ex- Quantum Mechanics and 
plain the interference results in experiments with microparticles (for ex- Interference 
ample, the interference pattern on the detector-screen in Experiment 1 of 
Section 7), we can formally proceed in two different ways. One way cor- 
responds to the “conservation” in quantum mechanics of the theorem of 
the addition of probabilities for any incompatible events. This way, how- 
ever, requires a comparison of the microparticle with some classical wave. 
The other way corresponds to the addition of probability amplitudes. In 
this case an explanation of interference results no longer requires the 


introduction of any visual wave model. 


The specific nature of microparticles, which has been discussed in 
detail in the preceding sections, precludes the first way and so puts the 
question of interference and wave processes in a new light. Before the 
appearance of quantum mechanics, interference was always considered as 
an example of a typical wave effect. If a characteristic interference pattern 
was observed in any experiment, it was considered sufficient evidence to 
draw conclusions on the presence of waves. In this sense, waves were con- 
sidered as being “primary”, and interference as being secondary. Quantum 


mechanics shows that the reverse order of emphasis is more correct. 
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In revealing that the probability laws of nature involve the addition of 
probability amplitudes and not of the probabilities themselves, quantum 
mechanics revealed the fundamental role of interference in physical 
phenomena. Simultaneously, it showed that classical wave processes need 
not essentially be at the root of an interference pattern. In the general case, 
interference is a specifically quantum-mechanical effect associated with the 


addition of probability amplitudes. 


Traditions, however, die hard. This explains attempts to “translate” 
the interference of probability amplitudes into the graphic language of 
classical waves, which inevitably leads to a definite misuse of wave ter- 
minology (see the above interlude “Are these the same waves?”). Ina 
number of cases the “translation” into wave language is not justified even 
from a formal point of view. For example, the division of microparticles 
into fermions and bosons, which is such an important consequence of 
the interference of amplitudes, is fairly difficult to explain on the basis 
of wave processes. The analysis of the process of the destruction of in- 
terference of amplitudes in the measuring process (“the reduction of the 
wave packet”) directly indicates the impropriety of using classical wave 
concepts when considering microphenomena. All this indicates that an ex- 
planation of interference obviously does not come within the framework 


of the traditional wave model. 


By the way, the last fact may be taken as the starting point for a gen- 
eralization of the very concept of “wave process”. Such a generalization 
assumes a transition from visual classical waves with real amplitudes to 
some sort of generalized waves with complex amplitudes. The classical 
waves must appear as an extreme (degenerate) case of such generalized 
waves. In other words, quantum-mechanical interference may be used 
for an extension of the framework of the accepted wave picture (which, 
incidentally, is invariably accompanied by rejection of a graphic represen- 
tation) and for creating a theory for the generalized wave processes which 
would reflect not only the probability nature of physical laws but also the 


special relations among probabilities in nature. 


By demonstrating the fundamental nature of the phenomenon of 
interference, quantum mechanics naturally arises an interest in the study 
of this phenomenon in different branches of physics. In our view, it raises 
definite hopes that modern physics, stimulated by the effect of interference, 
will develop in future into a study of the interference of effects in the fields 


of both microphenomena and macrophenomena. 


The idea of “addition” (summation, accumulation) of different phenom- 
ena is familiar to us. in a way, this can be compared with the “addition of 
probabilities.” It is likely that quantum mechanics tells us (indicates in its 


own way) that such a picture is the result of a certain “averaging”, approx- 
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imation or simplification of a finer and better picture in which we “add 
up” not the effects themselves, but something different (corresponding to 
probability amplitudes in the language of quantum mechanics), thereby 


arriving at the phenomenon of interference of effects. 


A Brief Interlude 


Fairy tales, though oft untrue, 
Teach good lads a thing or two. 
Pushkin 


READER: It is not clear what you wanted to say in the last sentences 
which, though quite eloquent, are tentative. Please explain them, if 


possible. 


AUTHOR: Gladly. Let us take a definite example. It is well known 
that if a substance is placed in a condenser, its optical properties will 
change under the influence of the external electric field. Such effects 
are called electro-optical effects. If the substance is subjected to a 
light field of sufficient intensity (generated, say, by a high-power 
laser), the optical properties change in this case also. These effects 
are called nonlinear optical effects. It so happens that if a condenser 
field and an intense light field are simultaneously switched on, then 
in addition to the familiar electro-optical and nonlinear optical 
effects we observe qualitatively different effects which may be 
explained only as a unique interference of electro-optics and non- 


linear optics. This is an example of the interference of effects. 


READER: But can one perceive in this example any tendency towards 


the development of modern physics? 


AUTHOR: Let us take another example, that of a laser. We shall not 
discuss the principle of its working here; it is just sufficient to men- 
tion that it is based on some nonlinear effect, called the saturation 
effect. Let us take another instrument, the second-harmonic gener- 
ator (that is what a transformer of coherent light which doubles 
the frequency is called in quantum mechanics). We shall simply 
state that this instrument also is based on the principle of nonlin- 
ear optical effect called the generation of second harmonics. Thus, 
the laser produces coherent light of a definite frequency, while the 
second-harmonic generator partially transforms the frequency of 
this light. We can say that we first use the saturation effect and then 
the effect of second-harmonic generation. Such is a general situa- 
tion corresponding to a simple “summing” of these effects. Now, 


suppose that both these effects are used simultaneously. In order 
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to do so, we must place a special crystal, which causes a trans for- 
mation of the frequency of light passing through it, inside the laser 
(or, more precisely, inside the resonator of the laser). Here, we are 
speaking of a qualitatively different situation corresponding to the 
interference of two nonlinear optical effects. It is quite significant that 
this and other similar situations have been increasingly drawing the 
attention of specialists engaged in the field of quantum electronics. 
Intraresonator generation of second harmonic is already being put 
into practice; it has been proved that it can ensure a more effective 


transformation of optical frequency. 


READER: This really sounds interesting. It is possible that some ten- 
dency is indicated here. But what has this got to do with quantum 


mechanics? 


AUTHOR: When studying reality on a fundamental level, quantum 
mechanics highlighted some important points. It showed that the 
question of interference is deeper than it was considered, and that 
this question can be posed independently of wave questions; and 
finally, that interference is an example of qualitatively new inter- 
relations, i.e. relations which obviously have more prospects than 
the traditional interrelations corresponding to a simple summing, 


adding, or accumulation. 


The dialectical nature of quantum mechanics is reflected in its very 
initial principles. In this connection the principle of complementarity, put 
forth by Bohr, is of special interest. This principle forms in fact the logical 


foundation of the entire system of quantum-mechanical ideas. 


The essence of the principle of complementarity lies in the following: 
it is stated that in any experiment with microparticles, the observer gets 
information not about the “properties of the particles themselves” but 
about the properties of the particles associated with some particular 
situation including, among other things, the measuring instruments. The 
information about the object obtained under some definite conditions 
should be considered as complementary to the information obtained 
under different conditions. Essentially, the information obtained under 
different circumstances cannot he added, accumulated or combined into 
a single picture; it reflects various sides (complementing one another) of 
a single reality, to wit the object under investigation. The principle of 
complementarity finds a direct expression, in particular, in the idea of 


wave-particle duality and in the uncertainty relations. 
According to Bohr, the term ‘complementarity’ is used 


in order to stress that in the contrasting phenomena we have to do with 


Complementary Principle 


PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 149 


equally essential aspects of all well-defined knowledge about the object. 21 


...In atomic physics the word ‘complementarity’ is used to characterize 
the relationship between experiences obtained by different experimental 


arrangements and visualizable only by mutually exclusive ideas ee 


Evidence obtained under different experimental conditions cannot be 
comprehended within a single picture, but must be regarded as comple- 


mentary ao 


In quantum physics, however, evidence about atomic objects obtained 
by different experimental arrangements exhibits a novel kind of comple- 
mentary relationship. Indeed, it must be recognized that such evidence 
which appears contradictory when combination into a single picture is 


attempted, exhausts all conceivable knowledge about the object.74 


We advise the reader to carefully read the words of Bohr once again. 
Thus, data about a microparticle may be “graphically interpreted” only on 
the basis of “ideas mutually excluding one another” In this sense they can- 
not be added in a simple way, and “cannot be contained in a single picture”. 
Various data have “peculiar” (the reader must not let this epithet go unno- 
ticed) relations with one another, hence the term “complementarity”. The 
peculiarity of the “complementarity” relations lies in the fact that data 
“complementary” to one another may be obtained only “under different 


experimental conditions? 


The specific nature of quantum-mechanical ideas, emphasized fre- 
quently in the foregoing, and their somewhat unusual logic rests to a 
considerable extent on the principle of complementarity. A microparticle 
is neither a corpuscle, nor a wave, but still we employ both these images, 
which mutually exclude each other, for describing a micro-particle. Just 
imagine the situation: the corpuscle and wave pictures are used for de- 
scribing an object which is neither a corpuscle, nor a wave, nor even a 
symbiosis of them! Naturally, this could give rise to a ticklish question: 
Doesn’t this mean an alienation of the image from the object, which is 
fraught with a transition to the position of subjectivism? A negative 
answer to this question is given by the principle of complementarity it- 
self. From the position of this principle, pictures mutually excluding one 
another are used as mutually complementary pictures, adequately rep- 
resenting various sides of the objective reality called the microparticle. 


According to Bohr, 


this point is of great logical consequence, since it is only the circumstance 
that we are presented with a choice of either tracing the path of a particle 
or observing interference effects, which allows us to escape from the 
paradoxical necessity of concluding that the behaviour of an electron or 


a proton should depend on the presence of a slit in the diaphragm through 


21y. Bohr, “On the Notions of Causality 
and Complementarity” 


22N, Bohr, “Natural Philosophy and 
Human Cultures” 


23N. Bohr, “Discussion with Einstein 
on Epistemological Problems in Atomic 
Physics” 


24N Bohr, “Quantum Physics and 
Philosophy”. 
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which it could be proved not to pass. 


It is true that dialectical nature is inherent in every physical science 
to some extent. Still it may be stated that classical physics, because of 
the very style of its philosophy (unambiguous predictions in theories of 
dynamic type, the approach to any object as a “combination” of certain 
“details”, and to any phenomenon as a succession of certain elementary 
events, etc.) is drawn towards metaphysics. In this sense the significance 
of quantum mechanics cannot be overestimated. It has convincingly 
shown that a higher level of knowledge of the laws of nature is inevitably 
linked with a deeper and more serious knowledge and application of the 


methods of materialistic dialectics. 


In considering the cases in which the dialectical character of quan- 
tum mechanics is especially manifested, we single out two points which 
appear most important to us. These are the statement of relations of di- 


alectical type and the application of the categories of dialectics. 
Statement of Relations of Dialectical Type. 


Simple accumulation, or summation of data, properties and concepts 
are characteristic (we shall say that summation relations are characteris- 
tic) of a metaphysical method. These relations to a considerable extent 
form the logical foundations of classical physics. Quantum mechanics lays 
emphasis on relations of a qualitatively different, dialectical type, like the 
relations of complementarity and relations of interference. Thus, it shows 
that data about an object, strictly speaking, do not simply add up but com- 
plement one another, that the probabilities of different events, strictly 
speaking, are not summed, but interfere with one another. We have dis- 
cussed these specific relations above when considering the principle of 
complementarity and the specific nature of the application of probability 
in quantum mechanics. When evaluating complementarity and interfer- 
ence as new relations, or new interrelations corresponding to a higher 
level of knowledge of the laws of nature, it can’t be denied that quantum 
mechanics really determines the direction for the development of modern 


physics. 
Application of the Categories of Materialistic Dialectics. 


In classical theories of dynamic type the concepts of necessity and 
chance, possibility and actuality do not appear as dialectical categories. 
Necessity here is absolutely (metaphysically) the opposite of chance. The 
latter is simply banished from the theory, which immediately leads to an 
identification of the concepts of the possibility and actuality. As dialectical 
categories, relations among which is unity and struggle of opposites, these 


concepts appear in statistical theories, primarily in quantum mechanics. 


The Dialectical Nature of 


Quantum Mechanics 
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It is essential that in quantum mechanics the dialectical categories of ne- 
cessity and chance, possib- ility and actuality are applied not only to the 
ensembles of objects but also to an individual microparticle. The applica- 
tion of dialectical categories in quantum mechanics has been considered 
throughout this book when discussing certain questions. We remind the 
reader that an analysis of the problem of causality in quantum mechanics 
could be accomplished only on the basis of the dialectical categories of 
necessity and chance. An explanation of the quantum-mechanical super- 
position of states and its destruction in the process of measurement was 
made possible only by using the dialectical categories of possibility and 


actuality. 


In conclusion, we: note that quantum mechanics has clearly demon- 
strated the dialectical struggle between form and content. In Bohr’s 


words, 


the lesson we have hereby received would seem to have brought us a 
decisive step further in the never ending struggle for harmony between 
content and form, and taught us once again that no content can be grasped 
without a formal frame and that any form, however useful it has hitherto 


proved, may be found to be too narrow to comprehend new experience. 
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Interlude: Do Quantum-Mechanical Concepts Contradict Our 


Common Sense? 


Common sense is that layer of prejudice 
we acquire before we are 16. 


A. Einstein 


In the balance between seriousness and humour, 
characteristic of all truly artistic achievements, we 
are reminded of complementary aspects conspicuous in 
children's play and no less appreciated in mature life. 


N. Bohr 


In this unusual discussion the 
author is joined by two physi- 


COOPER: There have been bitter complaints from some of our con- cists (Bohr and Cooper) and 


temporaries, that physics in the twentieth century has become two literateurs (Dobrolyubov 


too abstract, has lost touch with those things ordinary people can and Perrault). 


understand, has lost contact with common sense and substituted in- 25Excerpts from the following works 


have been used in the discussion: (Bohr, 
1958b, 1955; Dobrolyubov, 1948; Cooper, 
them.”° 1968; Perrault, 1946). 

Bohr, N. (1958b). Natural philosophy 
and human cultures. In Atomic Physics 


stead constructs so abstract that the ordinary mind can never attain 


AUTHOR: That is why I want to talk about quantum mechanics 
and Human Knowledge, pages 23-31. New 


and “common sense”. I have many times heard the complaint that 
quantum mechanics is hard to follow because its concepts are in 
“contradiction to common sense”. Unfortunately nobody knows 
precisely the meaning of “common sense”. If I am not mistaken, you 
are of the view that “common sense” is a relative concept, and that 


its meaning changes significantly with the development in science. 


COOPER: Yes, it is a cliché that the commonsense of the new genera- 
tion is formed from concepts laboriously constructed by their elders, 
that what is avant-garde for one generation is common sense and 
prosaic for the next. It seems dubious that the Newtonian concep- 
tion of the world would have been common sense to the Greeks in 
the time of Aristotle, for that matter to the scholastic scholars. It 
was not even so for many of his contemporaries. And those so en- 
amored of common sense (at present Newton’s world) are often just 
those who complain that the mechanical Newtonian view destroyed 


the magical medieval world. 


AUTHOR: It is difficult to counter such a point of view, By the way, it 
immediately solves the question posed by us at the very beginning. 


Isn’t it? 


COOPER: I think so, for the present generation of physicists it is 


quantum physics that is common sense. It is the structure of the 


York, Wiley; Bohr, N. (1955). Science 

and the unity of knowledge. In Leary, 

L., editor, The Unity of Knowledge, pages 
47-62. Doubleday; Dobrolyubov, N. A. 
(1948). What is oblomovshchina? Literary 
Criticism, 2; Cooper, L. N. (1968). An 
Introduction to the Meaning and Structure 
of Physics. Harper and Row, N.Y; and 
Perrault, C. (1946). The Sunshine Book 
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quantum theory that is closest to them, relations of the quantum 


theory that have an immediacy and an intuitive correctness ... 


AUTHOR: However, without denying the justification of the point 
of view mentioned above, let us try to solve the question of com- 
mon sense and look at this question from a more open position. 
Moreover, we would like to speak of the commonsense not of physi- 
cists, but of “ordinary people”, those who “bitterly complain” that 
the “ordinary mind can never attain” the ideas of modern physics. 
An approach from a more open position to the “common sense” in 
quantum mechanics was strongly advocated by one of our honoured 
participants. It would be nice to hear in this connection his remarks 


on the complementarity principle, for example. 


BOHR: Using the word “complementarity” much as it is used in 
atomic physics, to characterise the relationship between experiences 
obtained by different experimental arrangements, and visualizable 
only by mutually exclusive ideas, we may truly say that different 
human cultures are complementary to each other. Indeed, each 
such culture represents a harmonious balance of traditional con- 
ventions by means of which latent potentialities of human life can 
unfold themselves in a way which reveals to us new aspects of its 


unlimited richness and variety. 


AUTHOR: What is there that an ordinary person cannot comprehend? 
And yet the complementarity principle finally boils down to uncer- 
tainty relations as well as wave-particle duality of microparticles. 
Meanwhile, all these incomprehensible ideas find their due place 
not only in questions linked with human cultures but even in quite 


common questions, for example questions linked with psychology. 


BOHR: We all know the old saying that, if we try to analyse our own 
emotions, we hardly possess them any longer, and in that sense 
we recognise between physical experiences, for the descriptions of 
which words such as “thoughts” and “feelings” are adequately used, 
a complementary relationship similar to that between the experi- 
ences regarding the behaviour of atoms obtained under different 


experimental arrangements. 


AUTHOR: But can’t we go one step further, and try to draw some 
analogies between modern ideas in physics and ideas contained in 
famous literary works? It would be quite interesting to compare 


poetical truth and scientific truth. 


BOHR: In other words, is there a poetical or spiritual or cultural truth 


distinct from scientific truth? 


PHYSICAL FOUNDATIONS OF QUANTUM MECHANICS 


AUTHOR: Precisely. 


BOHR: We are indeed directly confronted with the relationship of 
science and art. The enrichment which art can give us originates in 
its power to remind us of harmonies beyond the grasp of systematic 


analysis. 


AUTHOR: I am extremely grateful to you. You have helped dispel 
some of my doubts. I would now turn to particular examples, the 
first example being Oblomov, a novel by I. A. Goncharov, Let us 
recall what one of our distinguished participants has written about 


the image of Oblomov. 


DOBROLUYBOV: Well, take for example this: Oblomov’s wishes 
always assume the form “how good it would be if this were done”, 
but how this can be done he does not know. That is why he is so 
fond of dreaming and dreads the moment when his dreams may 


come in contact with reality. 


AUTHOR: I remember that you drew analogy between Oblomov and 


such literary heroes as Onegin, Pechorin, Rudin. 


DOBROLUYBOV: Yes, indeed. But the point is that they all have one 
common feature-a fruitless striving for activity, the consciousness 


that they could do a great deal but will do nothing. 
AUTHOR: (to BOHR) And what would you say to this? 


BOHR: I would say that the relation between the experience of a 
feeling of volition and conscious pondering on motives for action is 


especially instructive. 


AUTHOR: I should state that the ideas expressed by my learned 
friends pertain to different situations and were profounded at dif- 
ferent times. And yet they do have something in common. To a cer- 
tain extent the personal tragedy of Oblomov lies in his inability to 
overcome the dialectical contradiction between the possible and the 
actual. Potentially, much is vested in Oblomov; this is an evidence 
of the considerable superposition of possibilities. But the “act of 
measurement”, which would allow Oblomov to realize some aspect 
of his nature at the cost of this superposition is missing. This act 
does not occur, the superposition is not destroyed and everything 


remains as it was. 


Let us now consider the second literary example, the famous fairy- 
tale “Cinderella” by Charles Perrault. Will you please narrate the 
scene in which the fairy godmother sees Cinderella off to the ball at 
King’s palace. 
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PERRAULT: Here you are: The fairy godmother then said to Cin- 
derella, “Here is a coach fit for the ball, and coachman and footmen.” 
As Cinderella stepped into the carriage, her fairy godmother said, 
“Remember, you must not stay a minute after twelve, for if you do, 
your coach will become a pumpkin again, the horses will turn back 
into mice; the footmen will become lizards, and the coachman will 


become a rat; and your dress will turn to rags.” 


AUTHOR: Thank you. I wanted to draw your attention to the fact that 
the omnipotent fairy godmother gave the coach and the dress to 
Cinderella only for a while, until midnight. And why not for good? 
It is clear that she could have done that but it would be against the 
inherent logic and the central idea of the tale. It would, so to say, 
take away the “charm”. If it is for a while, well and good; but if it 
is for ever, there is no charm. Doesn’t it remind us of a model of 
virtual transitions? The conservation laws are violated, treasures are 
created out of “nothing” with the help of a magic wand, but all this 
is allowed only for a finite interval of time — “until midnight.” After 
this, Cinderella must return to her previous state, and without the 
beautiful dress. Just compare: a quantum system visits a new level 
without any expenditure of energy from outside, only for a limited 
interval of time after which the system must return to its previous 
level. In the same way, Cinderella performs “virtual transitions” 
between her house and the royal palace, enjoys and dances but is 


careful not to overstep the agreed time limit. 


Then a messenger with the crystal slipper appears on the scene. 


Please narrate what happened after this. 


PERRAULT: The messenger, who had been sent with the slipper, 
said that everyone was to try it. He looked at Cinderella and saw 
that she was beautiful. He ordered her to sit down and put the 
slipper on It fitted her perfectly ... Just then the fairy godmother ap- 
peared and touched Cinderella with her wand and her rags became 


a dress more beautiful than any she had yet worn. 


AUTHOR: And so, it came true. The fairy godmother gave Cinderella 
the dress to keep for ever. The virtual transition led to a real tran- 
sition by Cinderella to “a new level”. The prince, the slipper, the 
messenger - all played the role of the photon which, by interacting 
with the system undergoing virtual transitions, has led to a real tran- 
sition. Of course, the Cinderella story should not be seriously taken 
as an illustration of the idea of virtual transitions, as an explanation 
of quantum jumps. In the same way the novel Oblomov should not 
be seriously taken as an illustration of the principle of superposition 


of states or as an explanation of the problem of destruction of super- 
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position in the act of measurement. However, it is easy to see the 


general internal logic in the comparison drawn above. 


BOHR: It may not be irreverent to remark that even at the climax of 
his work the artist relies on this common human foundation on 


which we stand. 


AUTHOR: With this interesting discussion we come to the end of 
our consideration of the basic concepts of quantum mechanics. Of 
course, quantum-mechanical ideas are quite unusual and peculiar. 
And yet they did not appear from nothing but rather arose on the 
basis of a solid foundation of concepts and ideas which are a re- 
sult of the entire human experience. Hence the analogies between 
physical models and literary images are not incidental, but rather 


inevitable. 
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LINEAR OPERATORS IN QUANTUM MECHANICS 


Every physical theory is a synthesis of certain physical ideas (advanced 


on the basis of experiment) and a certain mathematical apparatus. The 
building-up of a theory is a complicated and controversial process devel- 
oping according to successive approximations. However, in this contro- 
versial process, at least in its initial stage, a completely definite logical 


structure is envisaged, including three logically successive stages: 


1. a stage in which the basic ideas are formulated and interpreted, 


and the physical foundation of the theory is built; 


2. astage in which an adequate mathematical aparatus for the phys- 
ical ideas is found and the physical ideas and the mathematical 
apparatus are linked together, i.e. it is postulated exactly what 


physical meaning corresponds to the various mathematical sym- 


bols (as a result, the mathematical relations acquire the meaning of 


physical laws), and 


3. a stage in which the “physicized” mathematical apparatus is “set to 
work” The new results thus obtained are checked experimentally 
where this is possible. This leads to a further understanding of 
the physical content of theory and to a further development of its 


apparatus. 


At the time of creation of a theory the mathematical apparatus ade- 
quate for the description of physical ideas mayor may not be available. 
When Newton created his mechanics, he also had to work out the corre- 
sponding mathematical apparatus — the method of fluxions which later 
developed into differential and integral calculus. But when quantum me- 
chanics was created, the appropriate mathematical apparatus was already 


available in the form of theory of linear operators. 
In Quantum Physics and Philosophy', Bohr writes 


...in quantal formalism, the quantities by which the state of a physical sys- 
tem is ordinarily defined are replaced by symbolic operators subjected to a 
non-commutative algorism involving Planck’s constant. This procedure pre- 
vents a fixation of such quantities to the extent which would be required 
tor the deterministic description of classical physics, but allows us to de- 
termine their spectral distribution as revealed by evidence about atomic 
processes. In conformity with the non-pictorial character of the formal- 
ism, its physical interpretation finds expression in laws, of an essentially 


statistical type... 


Turning to the mathematical aspect of quantum theory, we shall con- 
sider below how quantum-mechanical ideas “are introduced into” the 
apparatus of linear operators. We shall demonstrate the working of this 


apparatus by a number of specially selected examples and problems. 
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1 Bohr, N. (1963). Essays 1958-1962 on 


atomic physics and human knowledge. 
terscience, N.Y., Interscience, NY 
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17 A Brief Look at the Theory of Linear Operators 


This section is essentially mathematical in nature. It includes different 
questions on the theory of linear operators. In other words, it describes 
the fundamentals of the mathematical apparatus which proved to be 
suitable for the creation of quantum theory. We emphasize that in this 
section none of the mathematical symbols is “loaded” with any physical 


meaning. 


An operator, when applied to some function, transforms it into a new 


function. The notation 
Ly (x) = (x) (17.1) 


means that the operator denoted the symbol L acts on the function y (x), 


as a result of which we get function ọ (x). 
The operator L is called linear if it satisfies the conditions: 


L(+) =Ly+Llyp 


E R (17.2) 
L (ay) =aLy 


where a is some number. We shall be using only linear operators below. 
The effect of an operator acting on a function may be represented as a 


definite or an improper integral: 


kie i L (x,y) Y (y) dy (17.3) 


The quantity L (x, y) is called the kernel of the operator. If the variable is 


discrete, we will have instead of (17.3) 
i Yn = E Lam Ym (17.4) 


The totality of coefficients Lym forms the matrix of the operator L, and we 
speak of the matrix representation of the operator. A matrix representa- 
tion is always possible because the kernel L(x, y) in (17.3) may obviously 


be treated as a continuous matrix. 


Suppose Ê y = 9; the operator L* is called the complex conjugate of the 
operator L, if by the action of this operator on the function * we get the 
function ¢”: 


L* y" (x) = g" (x) (17.5) 


The operator L is said to be the transpose of the operator L if the following 


condition is satisfied: 


f rwt ax= f yw iwo) dx (17.6) 


Linear Operators (Basic Defini- 


tions) 
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The kernel of the transposed operator satisfies the condition 


L (x,y) = L (y, x) (17.7) 
while the matrix satisfies the condition 
Dnm = Lam (17.8) 


Let us consider some linear operator L. Let us find its complex conjugate 
operator L*. We then find the transposed operator È for the operator Î*. 
This operator is denoted by L'? and is said to be conjugate to the operator 
L. By using the concept of the conjugate operator, two important types of 


linear operators are defined: Hermitian operators and unitary operators. 


If 


i=) (17.9) 
the operator L is called Hermitian (self-adjoint). If 


fE =f i= (17.10) 
the operator L is said to be a unitary operator. Note that unitary operators 
are usually denoted by the symbol Û. The matrix of a unitary operator 


satisfies the condition 
> Cal Sou (17.11) 
m 


Note an important property of unitary operators. Suppose 


` Unm Ym = Pn 
m 


It can be easily seen that 


S am > Vin Pm (17.12) 


The basic equation of the theory of linear operators has the form 
Ly=Ay (17.13) 


The numbers A for which the equation (17.13) has finite solutions, form 
the spectrum of eigenvalues of the operator L. The spectrum of eigenval- 
ues of an operator may be continuous, discrete or mixed. The solutions 
y(x) of equation (17.13) are called the eigenfunctions of the operator Ê. 
One or more eigenfunctions may correspond to a given eigenvalue. If s 
linearly independent eigenfunctions correspond to a certain value , it is 


said that the eigenvalue 4; is s-fold degenerate. 


We shall state three theorems concerning the basic Properties of Her- 
mitian Operators properties of Hermitian operators. (The proofs of the 


first two theorems are given in Appendix A.) 


2In the original typesetting symbol + is 
used to denote the conjugate, here we will 
be using + symbol. 


Properties of Hermitian Opera- 


tors 
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First Theorem 


An operator has real eigenvalues if and only if it is Hermitian. 


Second Theorem 


The eigenfunctions of a Hermitian operator, corresponding to 


different eigenvalues, are mutually orthogonal. 


Let L Wn = An Wn. The theorem states that for A, # Am, 


J Vin (x) Wn(x) dx = 0 (17.14) 


Since (17.13) is homogeneous, the eigenfunctions are determined 
upto an arbitrary constant multiplier. We choose this multiplier so 


that the normalization condition is satisfied: 


J Win (x) Yn(x) dx =1 (17.15) 


By combining (17.14) and (17.15), we get the condition of orthonor- 


malization of the eigenfunctions of a Hermitian operator: 


J Vin (X) Yn (x) dx = nm (17.16) 


If the spectrum of the operator is continuous, we get instead of 
(17.16) 


J C0 ta) de = 5-2 Kae 


Let us consider the case of the s-fold degeneracy of an eigenvalue. 
The eigenfunctions corresponding to this eigenvalue are, generally 
speaking, not orthonormalized. However, we can form s linear 
combinations of the given functions which satisfy the condition of 


orthonormalization. 


Third Theorem 


Any bounded function may be expanded in a series (integral) of 
eigenfunctions of a Hermitian operator. In other words, the system 
of eigenfunctions of a Hermitian operator is a closed (complete) 


one. 


Using the last theorem, we represent any particular function ®(x) 


in the form of a series of eigenfunctions 
Yn (x): B=)" en tn 
n 


In order to find c, we multiply this equation by y}, (x) and inte- 


grate with respect to x: 


J Yk) & (2) d= Yen fn (0) Yul da 
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Using (17.16), we get from this 
i Win (2) ® (20) dx = Y Cn Sm = Cm 
Thus, 
® (x) = $ cn Pal) 
where (17.18) 


Ch = 7 yh (x) B(x) dx 


In the case of a continuous spectrum, we must use the condition 


(17.17). As a result, we get instead of (17.18) 


w(x) = f ehd 
where (17.19) 


c(A) = J yi (x) (x) dx 


We shall mention one of the results which is a direct consequence 
of the completeness of the system of eigenfunctions of a Hermitian 


operator. Suppose 


(x) = [eta Yale) a 


we) =f B02") var) 


By using (17.17), it can be easily seen that 


I W* (x) B(x) dx = J b*(A) c(A) da (17.20) 


Suppose that the Hermitian operator M transforms the function ®(x) 
into the function ¥(x) : 
W(x) = M ®(x) (17.21) 


Let us expand the functions ® and ¥ in terms of eigenfunctions y}(x) of 
another Hermitian operator (the operator L). Assuming that the spectrum 


of the operator L is continuous, we have 


(x) = f ela) Yale) da 
(17.22) 


wx) = fb) ya aa 


The transformation (17.21) will then have some corresponding transforma- 
tion of the function c(A) to the function b(A). We write this transforma- 
tion in the form 


b(A) = M(A) c (å) (17.23) 


Representations 
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It is said that (17.21) and (17.23) are two different representations of the 
same transformation. The nature of the representation is determined by 
the variables on which the initial and final functions depend. Hence we 
speak of x-representation in the case of (17.21) and of A-representation 
(representation of the operator L) in the case of (17.23). Correspondingly, 
the operator M in (17.21) is the operator of the given transformation, 
defined in the x-representation [for convenience, we shall henceforth 
write it as M(x)], while the operator M(A) occurring in (17.23) is the 


operator of the given transformation, defined in A-representation. 


Let us see what the Hermitian operator looks like in eigen representa- 
tion. Let g, (x) be the eigenfunctions of the operator M. In this case, we 


can write (17.22) in the following form: 


B(x) = 3 c(i) p(x) dy 


(17.24) 
Yæ = f op) ay 
Applying the operator if M(x) to the function ®(x), we get 
MAD = f oly) Moga) dy 
= f wel) oul) dy 
Comparing this result with the second equality in (17.24), we have 
b(u) = uc(p) (17.25) 
whence we get 
M(p) = (17.26) 


Thus in its eigen representation, the Hermitian operator coincides with its 


eigenvalues. 


We shall now consider a more general situation: the operator M(x) 
is known, it is required to find out the form of this operator in the A- 


representation. Using (17.21) and (17.22) we write 
W(x) = M(x) ®(x) 
= f hO 
m / b(A’) yy (x) da! 


Multiplying both sides of the last of these equalities by y} (x) and integrat- 


ing with respect to x, we get 


fe | J 569 8109 Yuen ax| aa’ = foo) | [Hic vee ax| di’ 
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By using (17.17), we find that 


ia J | / POLOA ax| o(a’) da! 


Lastly, by taking (17.23) into account, we finally get the result [compare 
with (17.3)]: 
M(A) c(A) = ‘| M(A, a’) c(4’) da’ 


where 


M(A, a’) = / W% (<) M(x) Yar (x) dx (17.27) 


Thus, if the form of the operator is known, say, in the x-representation, 
we can find the matrix of this operator in the A-representation by using 
the eigenfunctions of the operator L given in the x-representation, by 


(17.27). 


In conclusion, we mention a situation where the eigenfunctions of one 
Hermitian operator are given in the representation of another operator. In 


this case, the following relation is valid: 


yal) = pA) (17.28) 


A transition from the A-representation to the x-representation is deter- 


mined with the help of (17.22). In operator form, these relations can be 


written as 

(x) = U(x, A) c(A) 

(17.29) 

W(x) = U(x, A) b(A) 

Next, we put 
/ W* (x) (x) dx = J W* (x) U(x, A) e(A) dx 
Z i c(A) [UT(A, x) W(x) ]" da 

Using (17.20), we get 

b(A) = UT (A, x) (x) 
Two important results follow from the last formula. Firstly, 

W(x) = U(x, A) b(A) 
= U(x, A) U1 (A, x) ¥(x) 

and, consequently, 

U(x,A) ÛÔÌ (A,x) =1 (17.30) 


This means that the transition from one representation to another is 


accomplished with the help of a unitary transformation. Secondly, if 


Transition from One Represen- 
tation to Another as Unitary 


Transformation 
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W(x) = M(x) ®(x), then 
U(x, A) b(A) = M(x) U(x, A) c(A) 
and, according to (17.30), 
b(A) = ÔÏ (A, x) M(x) U(x, A) c(A) (17.31) 
Comparing (17.31) and (17.23), we get 
M(A) = 0" (A, x) M(x) U(x, A) (17.32) 


It should be explained here that a product of operators means the succes- 
sive action of the operators on the functions to their right. In the present 
case we should first apply the operator U(x, A) to the function c(A), thus 
getting the function ®(x). Operator M(x) then acts on the function ®(x), 
resulting in the function ¥(x). Finally, the function V(x) is acted on by 
the operator Ut (A,x). 


Thus, the relations (17.29) describe the transition from one representa- 
tion of functions to another carried out with the help of a unitary opera- 


tor. Relation (17.32) describes the same transition for operators. 


Quantities and properties which do not change with unitary transfor- 
mations and are, consequently, independent of the choice of representa- 


tion are called unitary inuariants. Unitary invariants include: 


(a) the Hermitian property of an operator (if an operator is Hermitian 
in one representation, it will he Hermitian in any other representa- 


tion); 


(b 


we 


the spectrum of eigenvalues of a Hermitian operator; 


(c) the condition of orthonormalization of eigenfunctions [this is 


immediately seen from (17.12)]; 


(d) integrals of the type J W* (x) (x) dx [this directly follows from 
(17.20)]; 


~ 
oO 
<~ 


integrals of the type J W* M(x) ®(x) dx and of the more general 


type / W* M"(x) (x) dx, where n is a positive integer. 


Note that unitary invariance of these integrals means that the follow- 


ing relations hold: 
J W* M(x) ®(x) dx = J b* (à) M(A) c(A) da (17.33) 
/ W* M"(x) (x) dx = i b* (A) M” (A) c(A) da (17.34) 


Unitary Invariants 
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As an example we shall prove (17.33). Using (17.29) and (17.32), we succes- 
sively transform the left-hand side of (17.33): 


J ratna FG e(aydx = f [U* b*(A)| U M(A) c(A) dx 
= J b*(A) Ut U M(A) c(A) dà 
= J b* (A) M(A) c(A) da 

Thus, as a result, we get the right-hand side of (17.33), Q.E.D. 


Two operators Ê and M are called commutative if for any bounded 


function ®(x) satisfy the following condition: 
ML®(x) = LM@(x) (17.35) 


If there is even one function for which (17.35) does not hold, the operators 
L and M are called non-commutative. The notation 

[M, f] = MÊ- ÊM 
is called the commutator operator of Ê and M. If the operators commute, 
this fact is often expressed thus: 


[M, £] =0 


The following theorem holds: If the operators Ê and M have common 
eigenfunctions forming a closed system, the operators commute. We shall 


prove this theorem. 


We denote the eigenfunctions of the operators Land M by pry (x). 
(The double subscript Ay indicates the fact that these functions are com- 
mon). Obviously, 

ML Yap =AM ry 
=À H Yap 

LM Yap = pÊ Yap 
= pÀ Yap 


It follows from this that 
(ML-LIM) ty, =0 


and since the functions ¥,,(x) are known to form a closed system, for any 


function ®(x) we can write 
(ME EM) ®(x) = X cay (ME - EM) pay (x) 


=0 QED. 


Commutation of Operators 
and the System of Common 


Eigenfunctions 
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We stress the importance of the fact that common eigenfunctions must 
form a closed system. It may so happen that two operators have only 
one common eigenfunction. In this case it is impossible to draw any 


conclusion about the commutativity of the operators. 


The above theorem means that commutativity of operators is a neces- 
sary property of the commonness of the system of eigenfunctions of these 
operators. But is this property a sufficient condition also? The answer to 
this question is in the affirmative in the case when there is no degeneracy 
in the eigenvalues of the operators. In the more general case, taking into 
account the possibility of degeneracy, the following theorem is valid. If the 
operators L and M commute, we can find their common eigenfunctions. 
For example, suppose the eigenvalue 4; is s-fold degenerate. From the s 
solutions of the equation L y = A, y, we can choose s linear combinations 
which are also the eigenfunctions of the operator M. In this case, A will 


in general correspond to s different eigenvalues of the operator M. 


With this we conclude our degression in the field of “pure” mathemat- 
ics. The information contained here is sufficient for further use of the 


apparatus of linear operators without referring to special literature. 


18 From Hamiltonian Matrix to Energy Operator 


We turn to equation (12.8) discussed in Section 12. There we chose a The Influence of the Choice of 
certain system of basic states of microparticle {( i |}. An arbitrary state Basic States 
( s(t)| of this microparticle at time t was represented as a superposition of 


these basic states: 


(sl = S (5) 10) (il (18.1) 


The notation C;(t) was employed for the amplitudes (s(t) |i). It was 
shown that the amplitudes C;(t) satisfy the equation 


d 
-iñ Ci(t) =% Ø C(t) (18.2) 


which permits one to find, from a knowledge of the amplitudes C; (t) 
and the Hamiltonian matrix Hj;(t) at time t, the amplitudes C; at any 


subsequent moments of time. 


The expression (18.1) clearly shows that the set of amplitudes {C;(t)} 
depends on the choice of the system of basic states {( i |}. Suppose we 
have to go over to a new system of basic states {( m |}. In order to accom- 
plish this transition, we represent the earlier basic states in the form of 


superposition of new basic states: 


(il = )° (ilm) (m| (18.3) 
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Substituting this superposition in (18.1), we get 


(s(ÐI =D) 2, CO (ilm) (m| 
or 
(5(t)] = ¢m(t) (m| (18.4) 
where 


Cm(t) = Di cilt) (ilm) (18.5) 


The new amplitudes cm(t), corresponding to the system of {( m |}, satisfy 
an equation of the type (18.2) but obviously with a new Hamiltonian 


matrix Hmn (t): 


-ih Ë em (t) = È Hma(0) on(t) (18.6) 


We shall show how the old Hamiltonian matrix can he expressed in terms 


of the new one. Substituting (18.5) in (18.6), we find 
„d RRE l 
-ih =. Li (Im) = 2, 2 EmO C) (jln) 


Multiplying both sides of the last equation by (m | i} and summing with 


respect to m, we get 





d 
m 20 Dlm) cm 


-DE Em0 (jln) (m|i) 
j n m 


Taking into account that 


C;(t) 





SiH Lm) (mi) = Ci Li) = by 
we can simplify the left-hand side of the last equation. As a result, we get 
„d 
-ih = Ci(t) = X HOCA) 
J 


where 


Hy = X, J Hmn(t) Gln) (mli) (18.7) 


Thus the choice of any system of basic states of a microparticle influences 
both the form of the amplitudes of states satisfying (12.8) as well as the 


form of the Hamiltonian matrix. 


It was mentioned in Section 13 that the system of basic states is usu- 
ally chosen in such a way that these states could have a direct physical 
meaning. However, for the sake of greater convenience when consider- 


ing equation (12.8), we go over to new, specially selected basic states in a 
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number of cases. Thus, in Section 13, the transition from the basic states 
{1| and {2| to the basic states (I| and (I | was motivated by a desire to 
diagonalize the Hamiltonian matrix, while in Section 14 it was done in 
order to express the Hamiltonian matrix in a form convenient for drawing 


an analogy with an electron in magnetic field. 


Let us consider operator H(t) satisfying the following relation: 
HO) = GIAO i) (18.8) 


The operator H(t), acting on the basic state (i |, gives rise to a new state 
(w(t) | = H(t) |i), which is not basic. The element of the Hamiltonian 
matrix Hj;(t) here plays the role of the amplitude ( j | Y(t) ), i.e. of the 
amplitude probability that a microparticle in the basic state ( j | may be 
found in the state (/(t)|. Substituting (18.8) in (18.2), we get 


-nt (3091) = X GOID GOLD (18.9) 


or 
d 1 
-ih q; S |i) = (s(t) H(i) (18.10) 


Taking the complex-conjugate of the equation and considering (9.33) and 


(12.9), we get 
d 4 
“ih 7 CEIS) ) = CA) | s(t) (18.11) 


or 


-nt Ist) = H(t) |s(t)) (18.12) 


Note that while going over from (18.9) to (18.10) or from (18.11) to (18.12), 
we have made use of the rules given in Section 10 under the heading 
“The Mechanics of Quantum Mechanics”. The final equation (18.12) is 
analogous to equation (18.2), but unlike the latter, it is independent of the 


choice of basic states. 


In order to express the causal relationship mathematically (18.2) in 
the above discussion, we had to use information about the state under 
consideration, the physical meaning of the problem and the system of 
basic states. But now (18.12) it is sufficient to have information about the 
state under consideration and the physical meaning of the problem. This 
means that a transition from (18.2) to (18.12) leads to a more abstract level 


of analysis. 


We emphasize that such an abstraction is made possible by the use of 
operators. Operators help us to get quantum-mechanical relations that 
are more abstract and are independent of the choice of the system of basic 


states. 


The more abstract analysis, enabled by the introduction of operators 


Transformation of an Equation 
Expressing Causality Into 
a Form Independent of the 


Choice of Basic States 


Vector Analogy 
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is, to a certain extent, analogous to the more abstract analysis connected 


with the introduction of vectors. We shall consider this analogy in detail. 


This analogy is not unexpected. Remember that in Section 14 it was 
suggested that a state should be treated as a vector in some arbitrary 


space. The concept of projection amplitudes was also introduced there. 


The essence of the vector analogy is as follows. In order to perform 
operations with components of vectorial quantities one always has to 
choose a definite system of coordinate axes. In quantum mechanics we 
make a corresponding choice of a definite system of basic states. By 
using vectors, one can perform operations with vector quantities without 
resorting to a choice of any particular system of axes. Similarly, by using 
vectors of states and their operators in quantum mechanics, one can avoid 
having to choose a system of basic states. Let us illustrate the vector 


analogy with the help of the table below: 


Quantum-mechanical expression Vector Analogy 


Df li) (ils) =0 Yai bi = 0 


L 


(fis)=0 (ab) =0 


(Mutually orthogonal states) (Mutually orthogonal vectors) 
es 
in lO) = DH (j|s(t)) c= > Hy a; 

j J 


in [s()) =Als(0)) č=Ĥā 


(Operation on a state) (Operation on a vector) 


Each cell of the table contains two expressions having the same mean- 
ing (indicated in brackets). However, the upper expression depends on 
the choice of the system of basic states or coordinate axes, while the lower 


one is independent of any such choice. 


It is worthwhile giving an example for the lower right cell of the above 
table, which shows how one can imagine an operator acting on a vector. 
As an example of this (to which, of course, no “quantum-mechanical 
meaning” should be assigned) let us consider the case when the matrix Hj; 


is of the form 


ð ð 
0 pape ee 
or3 ore 
(x) Z ð ð ð 
z Or3 ory 
ð ð 
-— — 0 
Ore ory 


where r1, r2, r3 are three Cartesian coordinates in space. In this case the 
relation č = Ha acquires the form which is familiar to those who have 


studied vector analysis: ¢ = V x a. 
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In conclusion let us reiterate the various aspects of the vector analogy: 


(a) corresponding to the choice of basic states, we have a choice of the 
system of coordinate axes; 

(b) a transition from one set of basic states to another corresponds to 
a transition from one system of axes to another (note that such 
a transition does not involve the physics of the problem being 
considered); 

(c) the expansion into basic states corresponds to a representation 
of the vector in terms of its projections on the coordinate axes. 
The analogy with vectors is a good example of abstraction and of 
the representation of relations in a form taking into account only 


strictly physical information. 


Let us demonstrate some of the advantages of the operator approach. We 
shall show how the mean value ( E) of the energy of a microparticle in 
some state ( s | is determined. Let {( i |} be the basic states with energies 
E;. This means (see Section 13) that the Hamiltonian matrix is diagonal. 
Hence, 

(jl Hi) = by Ei 


or 
(jl H|i) = (ili) Ei (18.13) 


We expand the state (s | in terms of the system of basic states 
{il} (s]= J (sli) ci 


and try to find the mean value of the energy (E Y with the help of a for- 
mula of the type (12.3): 


(E)= J Gs PE, (18.14) 
i 
By taking (9.33) into account, we rewrite 18.14 in the form 


(E) = ) GIDE (ils) = (slp) (18.15) 


where 


le) =J li) E (ils) (18.16) 
From (18.13) it follows that 
Hli) = |i) E; (18.17) 
Substituting (18.17) in (18.16), we get 


le) = DA li) (Gls)=Ëls) 


Average Energy 
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after which we get the required result from (18.15): 
(E) =(s|H|s) (18.18) 


It can be seen from (18.18) that mean value of the energy of a microparti- 
cle in the state (s | is expressed only through the operator H. Basic states 


do not enter this relation. 


The convenience of the relation (18.18) is due to its independence 
from the choice of basic states, which allows one to use freely any system 
of basic states when carrying out calculations. For example, suppose it 
is convenient to use the basic states {({m1)}. In this case the operator 


relation 18.18 immediately acquire the corresponding form 
(E) =)" )\(s|m)(m| Hn) (n]s) (18.19) 


It has been mentioned above that the Hamiltonian matrix could be called 
the energy matrix (remember that the elements of the diagonalized Hamil- 
tonian matrix are essentially the possible values of the energy of the 
microparticle). The connection between the operator H and the Hamilto- 
nian matrix as well as relation (18.18), expressing the average energy of a 
microparticle in terms of the operator H, justify the name energy operator 


given to it. In the literature the operator it is also called the Hamiltonian. 


We write expressions derived above in which the Hamiltonian of a 
microparticle is present (we emphasize the exceptional importance of 


these expressions): 


in |s()) Ås) (18.12) 
(B) = (s|FiIs) (18:18) 
AIP) =If)E (18.20) 


In expression (18.20), ( f | denotes some stationary state and E is the 


energy in this state. 


Finally, we note that the Hamiltonian (as well as any other operator) 
may act not only on the state | s }, but also on its amplitude (i| s }, since 


we always have the representation [see (17.4)] 
FC,(t) =) Hy C;(t) (18.21) 
j 


Using (18.21) and taking into account that (i |s} = Cf, we can rewrite 
(18.2) in a form which, as can be easily seen, is completely analogous in 
form to (18.12): 

in © (i|s(t)) = Hil s(t)) (18.22) 


Energy Operator (Hamilto- 


nian) 
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Correspondingly, (18.20) may he written in the form 


Hilf) =Eilf) (18.23) 


19 Linear Operators in Quantum Mechanics 


We now come to the main problem of this chapter, i.e, that of impart- 
ing physical meaning to the mathematical apparatus of linear operators in 
order to convert it into the apparatus of quantum mechanics. In this sense 
the previous section should be considered as a preliminary step towards 


solving this problem. 


The following two points must be noted when considering the role of Role of Operators in Quantum 


linear operators in quantum mechanics. Mechanics 


Firstly, in quantum mechanics to every dynamic variable (spatial coor- 
dinate, energy, momentum, angular momentum, etc.), there is 


assigned a definite Hermitian operator. 


Secondly, the transition from one representation to another without 
changing the physical meaning of the problem is achieved with the 


help of unitary operators. 


Let us consider the first point in detail. It means that besides the energy 
operator H, other “physical operators” like the coordinate operator ? the 
momentum operator Ps the angular momentum operator M, etc. must be 
introduced. In this respect, it is significant that the well-known dynamic 
relations of classical mechanics may be transferred to quantum mechanics 
in the same form, if we replace the physical quantities in these relations 

by the corresponding Hermitian operators. In other words, the apparatus 

of quantum mechanics may be built up in analogy with the apparatus 

of classical mechanics, if we replace the dynamic variables with their 
corresponding Hermitian operators. As an example, let us compare the 


following expressions: 





In classical mechanics In Quantum Mechanics 
2 52 
E= +U Hs! +Û (19.1) 
2m _ 2m 
M = (7x) M= (F xp) (19.2) 


It should be remembered, however, that a complete formal analogy 
between the apparatus of classical and quantum mechanics does not exist. 
We point out (for details see Section 20) that in the algebraic manipulation 


of operator relations we must remember that operators may not commute. 
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x à a aya h rare à 
Thus if A and B do not commute, then (å + b) + A? + 2AB + Ê?. In this 


a a\2 a ro a E 
case (4 + Ê) = Â? +AB+BA+B’. Besides, there are operators in quantum 
mechanics which do not have classical analogies (for example, the spin 


operator). 


Note that we may formally assign corresponding operators to all clas- 
sical dynamical variables including those which do not have any meaning 


in the microworld. 


Thus, we may introduce the operators of velocity 3, of potential energy 
U, kinetic energy T, etc., though neither velocity nor the breaking-up of 
total energy into kinetic and potential components which is characteristic 


for classical mechanics, have any meaning for microparticles. 


Let us consider the question: In exactly what way can we compare a Basic Postulates 
physical quantity with a Hermitian operator? In other words, what is the 
meaning of the word “compare” here? The following two basic postulates 


provide an answer to this question. 


Postulate 1. If the operator Ê is compared with a physical quantity l, it 
means that the eigenvalues A of the operator are identified with 
the values of the physical quantity under consideration obtained by 


measurement. 


Postulate 2. If operator Ê is compared with a physical quantity L, it means 
that the eigenfunctions y, (æ) of the operator are identified with 
the eigenfunctions of the quantities from the A-set, expressed in 
a-representation. By the remarks made in Section 15, this means 
that the eigenfunctions y, (æ) of the operator are identified with 


the amplitudes of (åA | æ }, often called the wave functions. 


Thus a study of the fundamental equation of the theory of linear opera- 
tors [see (17.13)] 
L(a) (Ala) =AAla@) (19.3) 


includes physical problems such as finding the spectrum of possible val- 
ues A of a physical quantity J, and finding the amplitudes of states (A | a) 


in which the corresponding values of À occur. 


When applied to the energy operator, (19.3) takes the form of (18.23). 
A study of equation (18.23) permits us to find the possible values of the 
energy of a microparticle and the corresponding amplitude values of the 


stationary states. 


The postulates formulated above “knit together” the physical and math- Mathematical Results and 
ematical aspects; they “load” the mathematical symbols and inferences Their Physical Meaning 


with a definite physical meaning. We shall demonstrate this with a num- 
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ber of observations. 


1. The eigenvalues of a Hermitian operator are real. From a physical 
point of view this means that the values of quantities obtained 


during measurements are real. 


2. The spectrum of the eigenvalues of a Hermitian operator may 
be discrete or continuous. This corresponds to quantization or a 
continuous variation of the physical quantities characterizing a 


microparticle. 


3. The eigenfunctions of a Hermitian operator satisfy the condition of 
orthonormalization. This mathematical fact becomes the condition 
of orthonormalization of eigenfunctions of physical quantities 
and, in particular, the condition of orthogonality of basic states of 
a microparticle. In other words, the mathematical result (17.16) 
is converted into physical relations (15.13) and (10.8), while the 
mathematical result (17.17) is converted into the physical relation 
(15.14). 


4. The system of eigenfunctions of a Hermitian operator is a closed 
(complete) one. From a physical point of view, this means that 
it is possible to expand any amplitude in terms of amplitudes 
which are eigenfunctions of a physical quantity, ie. in terms of 
basic amplitudes. In other words, the completeness of a system 
of eigenfunctions of a Hermitian operator is converted into the 


physical principle of superposition of states. 


5. The eigenvalues of a Hermitian operator may be degenerate. Phys- 
ically this means that one value of a quantity may correspond to 


several different states. 


6. The mathematical result (17.28) for the eigenfunctions of operators 


corresponds to the physical result (9.33) for amplitudes of states. 


7. Unitary transformations corresponding to the transition from one 
representation to another, physically correspond to a transition 
from one complete set of quantities to another and, in particular, 


from one system of basic states of a microparticle to another. 


8. The existence of a common complete system of eigenfunctions 
means that the operators commute. This mathematical fact cor- 
responds to the possibility of simultaneous measurement of the 
corresponding physical values. Note that the impossibility of the 
simultaneous measurement of such physical quantities as the coor- 
dinate and momentum of a microparticle means that the operators 


of coordinate and momentum do not commute. 
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9. The mathematical fact of commutativity of the Hamiltonian H and 
the operator Ê means that, physically the quantity | corresponding 
to the operator L is an integral of motion. In other words, the 
condition 

[H, L] =0 


from the physical point of view is the law of conservation of the 


quantity l. 


The last remark will be rigorously proved below. Here, we shall just 
mention some ideas of a qualitative nature for this purpose. If operators 
H and L commute, the quantities E and / can be simultaneously measured 
since there are states in which both these quantities have definite values. 
A state in which energy has a definite value is stationary, i.e. has an 
infinitely long “life” time. But this means that the quantity l must also be 
conserved for an infinitely long time, just like any other physical quantity 


for which the given state is an eigenfunction. 


If the quantity l is measured in a state described by the amplitude 

(A |a), then, according to basic postulates, the measured value will be À. 
We assume now that the quantity l is measured not in its eigenstate, but 
in some “other” state, for example, in the state described by the amplitude 
®,(a) = (s|a@). In this case the result of a single measurement cannot be 
predicted unambiguously; probabilistic predictions enter into the picture 
now, thus permitting an estimation of the mean value ( A) from a rela- 
tively large number of measurements (in this connection see Section 12). 
We shall show how to compute the mean value (4A } in the state (s |, if 
we know the Hermitian operator Ê corresponding to the quantity L, Note 
that in the particular case when energy is used in place of the quantity l, 
this problem was considered in Section 18, where the following result was 
obtained: 

(E) =(s|H|s) (18.18) 


In the general case, (18.18) takes the form 
(A)= 7 D(a) L(a) O(a) da (19.4) 


(we assume for definiteness in this case that the variable a, which charac- 


terizes the representation varies continuously). Let us prove this. 


We expand the amplitudes ©,(@) and ®ž (æ) in terms of eigenfunctions 
Wn(a) = (An |æ} of the operator L (we assume that the spectrum of the 


operator L is discrete): 
(a) = X, (slår) Yala) 
Pila) = X, (s lAm)" Yin(@) 


Mean Value of a Quantity 
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Substituting these superpositions in (19.4) and using (19.3) and (17.16), we 
get 


feo ite P, (æ) da 
= DY Csa) (slam) f Ya) Ela) Yala) da 


=D $, C514a) (slAn) An f Yl) ala) da 
= S151 (san) Aa Andmn 
= $, Ks An)? an 


By (12.3), the last sum is equal to (1), Q.E.D. 


The result (19.4) is very important. In fact this one result is sufficient 
to demonstrate the usefulness of the application of operators in quantum 


mechanics. 


In analogy with (18.18), the result (19.4) can be written in a more ab- 


stract form which avoids a choice of representation: 


(A) =(s|L|s) (19.5) 


Using (19.5) and assuming at the outset that the operator L is indepen- 


dent of time, let us write 


d d r 
7) = 7 GOILO) 


i ; aT (19.6) 
= (5 (s(t) ) |L| s(t) ) + (s(t) | LI (5 | s(t) | 
Further, we turn to (18.12) and transform it into the form 
in (s(t) | = (s) | 
i J s =(s 
or, taking into account the hermiticity of the Hamiltonian, 
d : 
-iñ (s(t)| = (s(t)|H (19.7) 
Substituting (18.12) and (19.7) into (19.6), we get 
2 (ay = 4 (s(t) IAE- LA s(0)) 
dt =a 
or 
# (ay=4(s! [AE] Is) (19.8) 
dt Ta eae i 


d 
If the quantity / is an integral of motion, FF (A) = 0. It follows from 
(19.8) that the above-mentioned condition [H, [| = 0 is a condition for the 


The Variation of the Mean 
Value of a Quantity with Time 
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conservation of the quantity 1. 


We introduce a new operator L, defining it by the relation 


s d 
(sli Is) = z 4 (19.9) 


Comparing (19.9) with (19.8), we conclude that in the case when Lis 
independent of time, the operator L is of the form 
i 


L=- [Af] (19.10) 


=~! 


If Ê varies with time, we get instead of (19.10)) 


i 


h 


Ê (19.11) 


œ> 


L= [H,L] + 


|% 


The physical meaning of any result is obviously independent of the choice 
of a representation. In other words, the physical meaning of a result 
should not change upon a transition from one representation to another. 
Since these transitions are accomplished by means of unitary transforma- 
tions, it means that physical results must enter the mathematical apparatus 


as unitary invariants. 


Thus, the requirement of unitary invariance of corresponding results 
may serve as an additional criterion of the correctness of the basic pos- 
tulates formulated above and the consequences that follow from them. 

In this connection, we first mention the fact of unitary invariance of the 
property of hermiticity of an operator mentioned in Section 17 also the 
unitary invariance of the spectrum of eigenvalues of a Hermitian operator. 
It is easy to see that the commutator [H, È] is also a unitary invariant and, 
consequently, the condition of conservation of any physical quantity is, 

as expected, independent of the choice of a representation. We mention 
further that unitary invariance of the relation J Yi Ym dx now indicates 
the independence from the choice of representation of the condition of or- 
thonormalization of eigenfunctions of a physical quantity. Finally, unitary 


invariance of the expression 


/ * (x) L ®(x)dx 


indicates the independence from the choice of representation of the mean 


values of physical quantities. 


Unitary Invariance of Physical 


Results 


Note that the result (19.4) could have been obtained unambiguously from the requirement of the 


unitary invariance of the quantity (A) using the expression (18.18) obtained for a special case. Indeed, 


from the requirement of unitary invariance it follows that (2) must be represented by an expression of 


the type 
J * (x) Ê” (x) P(x) dx 


and a comparison with the particular result (18.18) indicates that in this case n must be put equal to 1. 
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20 The Quantum-Mechanical Apparatus in Coordinate Repre- 


sentation 


A further treatment of the apparatus of quantum mechanics requires 
a knowledge of the definite form of the operators of various physical 
quantities. For this it is essential to choose a definite representation. Let 


us choose the coordinate representation for this purpose. 


Note the importance of finding the form of the two basic “physical op- 
erators”, the coordinate and the momentum of a microparticle. Knowing 
these operators, we may obtain the energy operator [see (19.1)] and the 


angular momentum operator [see (19.2)]. 


For simplicity we shall consider one-dimensional motion along the x- 
axis (the result so obtained can be easily generalized to a three-dimensional 
case). Taking into account the remarks made in Section 17 about the form 
of Hermitian operator in its eigen representation [see (17.26)], we con- 
clude that the operator of a coordinate in the coordinate representation is 
the coordinate itself: 

x(x) =x (20.1) 


This result can be generalized to any coordinate function: 
U(x) = U(x) (20.2) 


Let us now try to find the form of the momentum operator. First we shall 
prove the following theorem. Suppose an operator O somehow transforms 
a coordinate. If the Hamiltonian H remains invariant under this transfor- 

mation, the operators O and H commute. 


Proof: Let Ox =x’; we act on H(x) (x) with the operator Oto get 


OH(x) y(x) = A(x’) y(x’) 
= H(x) p(x’) 
= H(x) Ô y(x), 


which proves the theorem. 


Suppose the operator O is the operator of infinitesimal translation 
along the x-axis: Ô y(x) = y(x + Ax). Making use of the smallness of the 


translation, we write 


W(x + Ax) = 





1+ are W(x) 


Thus 
O=1+A 4 
~ “Tx 


Proceeding from the properties of homogeneity of space, we conclude 


that an operation by Ô must leave the Hamiltonian of a microparticle 


Operators of Coordinate and 


Momentum 
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invariant. Hence, according to the theorem proved above, we obtain 


CEC |a =0 

dx 
But it was shown above in Section 19 that a commutation with the Hamil- 
tonian expresses the law of conservation of a physical quantity. This 
means that T is the operator of some physical quantity which is con- 
served. We know that momentum is a quantity whose conservation is a 
consequence of the homogeneity of space (see Section 1). Consequently, 
the operator T must coincide with the momentum operator of a mi- 


croparticle up to some constant factor: 


— = yfx (20.3) 


The factor y is determined from a consideration of the limiting transition 
from quantum mechanics to classical mechanics (see Appendix B). It 

i 
can be shown that y = —. Thus the operator of the x-component of a 


microparticle momentum in the coordinate representation has the form: 


bx = -nt (20.4) 


The results (20.1) and (20.4) can be easily generalized to a three-dimensional 
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case: 
=F (20.5) 
P =-ihV (20.6) 
Using (20.4), we can write an equation for the eigenfunctions of the x- Eigenfunctions of Momentum 
component of the momentum: 
d 
-iho 0 = Ps Vou) (20.7) 


It can be easily seen that (20.7) can be solved for any values of the param- 
eter px. Consequently, the momentum of a microparticle is not quantized 


(the spectrum of the eigenvalues of the momentum is continuous). 


From equation (20.7) it follows that the eigenfunctions of operator px 


have the form of plane waves: 





Wp, (x) = A exp ( Pr =) (20.8) 


To determine the factor A, we make use of the condition of orthonormal- 


ization (17.17)): 
J Wp Woa) dx = Slps = po) 
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Substituting (20.8) into this equation, we find 


al f exp (PRP?) = aps — 90) 


Further, taking into account (15.17), we obtain 
i exp {ete dx = 27 h d(px — pl) 


1 
Comparing the last two equations, we get A? = a Consequently, 
T 








yy, (x) = i exp (2 =) (20.9) 


A generalization for the three-dimensional case gives 


yz) = (=a) exp (7) (20.10) 


Note that the eigenfunction of the momentum (20.10) coincides with the 
wave function (15.15) derived in Section 15 for a freely moving microparti- 


cle. 


Let us consider the equation (18.23) for eigenfunctions of a Hamilto- Schrodinger Equation 
nian: 


Ê og (x) = E gr(x) (20.11) 


Using (19.1) for the Hamiltonian of a microparticle moving in an external 


field with potential U(x), and taking into account (20.2) and (20.4), we get 


2 2 


^ h 


Substituting (20.12) into (20.11) we get 


h? a 


-= Z Pela) + [U (x) - E] gz(x) =0 (20.13) 


This is the one-dimensional Schrödinger equation. Generalizing it for the 


three-dimensional case, we write 


2 
-Ag + (UG) - E] gF) =0 (20.14) 
m 


Knowing the functions g(x), we may write the expressions for ampli- 
tudes of stationary states ¥g(x, t), since the time dependence in this case 
has the universal form discussed in Section 13. Using (13.4) and taking 


into account the fact that 
eet) =< Elta Ey =C) (20.15) 


we get 
—iEt 
YPz(x, t) = g(x) exp (=) (20.16) 
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It can be easily seen that the functions Yz(x, t) are the solutions of equa- 
tion (18.22), where the Hamiltonian (20.12) has been used in place of H. In 
this case, the equation has the form 

K o 


a] 
iħ—Y¥ = -— —~ Y¥+U(x)Y 20.17 
"at 2m ax? (x) ( ) 


It is also called the Schrodinger equation More precisely, equation (20.13) 
is called the time-independent Schrödinger equation and the equation 


(20.17) is called the time-dependent Schrödinger equation. 


Schrédingers contribution was that he guessed (yes, guessed!) how to 
write the Hamiltonian of a microparticle in the form (20.12). It is true that 
in our discussions (20.12) does not appear unexpectedly, it appears here as 
a consequence of results (19.1), (20.2) and (20.4). It should, however, be re- 
membered that (19.1) has not been derived here, in fact it was postulated 
(more precisely, an analogy between classical and quantum-mechanical re- 
lations was postulated). When Schrédinger proposed his famous equation, 
this analogy was not yet apparent. Moreover, the result (20.12) itself, as 
we shall see below, served as the basis for this analogy. (For establishing 
the relation (20.17), we may make use of the transition from quantum to 


classical mechanics - see Appendix B.) 


By using (19.2) and (20.4) we can easily get the expressions for the 


operators of projections of the angular momentum: 


R ð ð 
Nig = in (y 2-25) 


Oz oy 

E ð ð 
M, = —iħ |z — - x — 20.18 
” i | ox i z) ( ) 

“ ð ð 

Mz = —iħ | x — — y — 

oy Ox 


The operator of the square of angular momentum is given by the expres- 
sion 
2 iR ny a yy? 
M° = M; + M; +M; (20.19) 
While considering the angular momentum operators, it is convenient to 


use spherical coordinates r, 0, o instead of Cartesian coordinates x, y, z: 


x = r sin 0 cos Q 
y = r sin 0 cos Q (20.20) 


z=rcos0 


ð 
Using (20.20), we represent the derivative PA in the form 
p 





a, ow ax , oy dy _ W Oz 
Op ~ ax 0p oy Op az OY 


Operators of the Angular Mo- 
mentum Projections and the 


Square of Angular Momentum 
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ð 
= —-— rsin sino + N tup 
Ox oy 


ð ð 
=(:5-93)y 


From this, with the help of (20.18), we get 


= n2 20.21 
ha (20.21) 


ð 
In a similar way from the derivative a y we find 
Mx + iM, = h(exp{+io}) i eon” (20.22) 
x + iM, = h(exp{ti +— + icot0— . 
y Pi=lp 30 ap 


from which, with the help of (20.19), we get 


` 1 æ 1 ə 
M = -h° — 6 20.23 
E ap? * Sin 00 (si iia 5]| ( ) 








These relations form the commutation rules for the operators of coordi- 
nate, momentum and angular momentum of a microparticle. Denoting 
the Cartesian components of these operators by subscripts i, j, k, we can 
write these commutation rules (it will be shown later on how they may be 


derived): 





[A A] = 0 (20.24) 
[pi Pi] = (20.25) 
[bi f] = —ihd;; (20.26) 
x Ry 0 hs 
[Pi FO) = mie (20.27) 
[Mi Fj] = th 2 eijkfk (20.28) 
[M pj] = ih 5 eijkpk (20.29) 
[Mi Mj] =ih 5 eijkMk (20.30) 
k 

Here e;;, is a unit antisymmetric tensor of the 3rd rank, e123 = e231 = 

€312 = 1, €132 = €321 = €213 = —1, and the remaining 21 components of this 


tensor are equal to zero (in these components at least two subscripts have 
the same value). It can be easily seen that in summations with respect to k 


not more than one term is present. 


In the coordinate representation f; = r;; therefore, the result (20.24) 
is obvious. It means that all three coordinates of a microparticle can be 
measured simultaneously as was mentioned in Section 3, they occur in the 


same complete set of quantities). From (20.4), we write 








a a 


5. pf: =— 2 = 
[Pi PilY ü A ðrjðri 


Commutation Relations 
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Since the value of the mixed derivative is independent of the order in 
which the differentiation is carried out, it follows that [p;, p;] = 0. This re- 
sult means that all the three components of momentum can be measured 


simultaneously (they belong to the same complete set of quantities). 


The result (20.26) is derived in the following way: 
TOR . ð ð Oi . 
[pis ]y = -iñ A mame et Sr ay, 


It means that components of momentum and coordinate which have dif- 
ferent subscripts can be simultaneously measured, while components with 
like subscripts are unmeasurable, in complete accord with the uncertainty 
relations for coordinate and momentum of a microparticle described in 
Section 3. The result (20.26) also indicates that the three components of 
coordinate and the three components of momentum enter in different 


complete sets of quantities. 


The result (20.27) is a generalization of (20.26). In fact, 


sisal Uh ko el 
[b flv = a| Zew Foy} = ney 


Results (20.28)-(20.30) can be obtained from (19.2)) by using the preceding 


commutation relations (see Appendix C). 


Results (20.28) and (20.29) mean that components of the angular mo- 
mentum and coordinate (angular momentum and momentum) with like 
subscripts can be measured simultaneously, while those with different 
subscripts cannot be measured simultaneously. These results also mean 
that projections of the angular momentum cannot belong to complete sets 


which include coordinates or the momentum components. 


The result (20.30) means that different components of angular momen- 
tum don’t have a common closed system of eigenfunctions and cannot 
appear in the same complete set of quantities. Considering the examples 
discussed earlier, we can conclude that different components of angu- 
lar momentum cannot be measured simultaneously. This conclusion is 
correct, but it should be slightly improved on the basis of the example 
of the components of angular momentum. As a matter of fact, there is 
one case when all the three components of angular momentum can be 
simultaneously measured - this is the particular case when all of them are 
equal to zero. This case, however, does not essentially change anything 
since, as has been remarked in Section 17, the presence of one common 


eigenfunction is in no way connected with the commutation of operators. 


By using relations (20.19) and (20.30), we can establish one more rule 
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for commutation (see Appendix D): 
[M2, M] =0 (20.31) 


This means that we must include the square of the angular momentum 
and anyone of the projections of the angular momentum in the same 


complete set of quantifies. 


Note that the simultaneous measurability of all the components of momentum and the impossibility 


of similar measurement for the angular momentum components have a very simple explanation. The 


fact is that the parallel translations associated with the momentum operator are commutative, while 


the rotations associated with the angular momentum operator are non-commutative. It is immaterial 


whether we move first along the x-axis and then along the y-axis, or in the reverse order. 


However, the sequence of rotations is certainly not immaterial. Take, for example, a point on the 


z-axis and make two successive rotations through 90° - in one case first around the x-axis and then 


around the z-axis, in the other ease first around the z-axis and then around the z-axis. It can be easily 


seen that the final positions of the point are different in these two cases. 


The inversion operator P is defined in the following way: 
Py(7,t) = Py(-7,t) (20.32) 


where P is a constant. By applying the inversion operator twice, we obvi- 
ously return to the initial function y (F, t). It follows from this that P? =1, 
Le. 

P=+1 (20.33) 


The quantity P is called the spatial parity. If P = 1, and consequently, 
Py(7,t) = (-7,t), the microparticle is said to possess even parity. If, 
however, P = —1 and, consequently, Py, t) = -/(-7, t), the microparti- 


cle is said to possess odd parity. 


Suppose [P,H] = 0. In this case, according to (19.10), the parity is 
a conservable quantity. If at the initial moment of time the microparti- 
cle was, for example, in a state with even parity, it must have the same 
parity at subsequent moments of time (which, of course, imposes certain 


restrictions on the possible changes in the state of the micro-particle). 


It was mentioned in Section 1 that the laws of conservation of energy, 
momentum and angular momentum are the results of definite properties 
of the symmetry of space and time. The law of conservation of parity is 
no exception to this. It is a consequence of the symmetry with respect 
to inversion operation, which, as can be easily seen, reduces to a combi- 
nation of the operation of rotation and reflection [in fact, the operation 
(x,y,z) — (-x,-y, -z) consists of rotation through 180’, say, around 
the z-axis, and reflection in the plane perpendicular to the z-axis]. Taking 


into account that the conservation of angular momentum is linked with 


The Inversion Operator; Parity 
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rotational symmetry, we conclude: the conservation of spatial parity is 
associated with the fact that physical processes take place identically in 


the “usual world” and the “mirror world” 3 


Let us write the equation for the eigenfunctions of the operator M, 
defined by (20.21): 
ð 
-ih—wy = M, 20.34 
hV = Mey (20.34) 


The solutions of this equation are of the form of the 





iM: 
Jlo) = Aexp $ e) (20.35) 
The function y is periodic: f(g + 2m) = Y(ọ). Consequently, 
M, = him, m = 0, +1, +2 (20.36) 


The reader is already familiar with this results: the projection of angu- 
lar momentum is quantized; it assumes values differing by multiples of 
Planck’s constant (see Section 2). The factor A in (20.35) is determined 


from the normalization condition 


| Yates 


It is easy to see that A = (2x)-"/?. Thus 


Ym () = (2x)? exp(ime) (20.37) 


We now turn to the operator M. Using (20.23), we write 


1 æy 1 ə əy 
—h in 0 =M? 20.38 
sin? 0 ay? * Sind 90 [sn || y ( ) 





This equation is known in mathematics as the equation for spherical func- 


tions. It has bounded solutions under the condition that 
M =h? l(1+1), 1=0,1,2, (20.39) 


Assuming that the condition (20.39) is satisfied, we write the solutions of 


(20.38) in the form of spherical harmonics: 


Yim = (Ep (cos 9) exp(img) (20.40) 


where m = 0,+1,..., +l. Taking (20.31) into consideration, we conclude 
that the eigenfunctions of M: and M2 are common. Hence, m should be 
treated as a magnetic quantum number corresponding to the projection 


of the angular momentum of the z-axis. It assumes 21 + 1 values (from 


3Tt must he noted that in some pro- 
cesses involving elementary particles, the 
symmetry of “usual world” and “mirror 
world” is violated. It was shown by Wu in 
1957 that in experiments on the -decay 
of nuclei, spatial parity is not conserved. 
This result was predicted in 1956 by Lee 
and Yang. 


Eigenvalues and Eigenfunc- 


tions Operators Mz and M? 


190 BASIC CONCEPTS OF QUANTUM MECHANICS 


-l to l). The functions pm (cos 0) appearing in (20.40) are essentially the 


associated Legendre functions. We remind the reader that 
m 2 die d™ 
P(x) = (1- x°) Gym) (20.41) 


where P;(x) are Legendre polynomials *: 


L 


Pix) = Tn ad 


= [° - 1] (20.42) 


The spherical functions Yim(0, p) are orthonormalized: 


2m n 


J J Vin (0, P) Vim (0, p) sin 0 dO do = Sy mw (20.43) 
0 


0 


If the result (20.36) is known, we can derive the result (20.39) by assuming 
that 
M? =3({ M2) = 3h" (m°) 


The mean value (m° ) is determined by the expression 


Í 
(m°) = Da 2+1 D 


m=—1 m=0 


m2 








1 
= sl(l+1) 


This leads directly to (20.39). 


It follows from (20.18) that the inversion operators commute with 
operators of any projection of the angular momentum. Moreover, the 
inversion operators and the operators of the square of the angular momen- 
tum commute: 

[P,M;]=0, [P,M2] =0 (20.44) 


This mean that the operators P and M; have a common complete system 
of eigenfunctions. The same applies to the operators P and M2. From this 
it follows in particular that a state with a definite orbital quantum num- 
ber / must also be characterized by a definite spatial parity. In spherical 


coordinates the inversion is of the form 
ror @>27-0 gr OtT. (20.45) 


Using (20.40)-(20.42) we find that for such a transformation the function 
Vim (8, p) is multiplied by (—1)!: 


Vim => (=1)' Yim (20.46) 


It follows hence that states with even l have an even parity while the 


states with odd l have an odd parity. 


It is appropriate here to recall that the example given in Section 3 for 


the complete set of quantities describing the state of a photon includes 


Legendre polynomials and associated 
functions are considered in Appendix D. 


Parity and Angular Momen- 


tum 
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M°, M; and P [see (3.7b). Note that the parity and the angular momen- 
tum occur in the same complete set of quantities. Formally, this is a 
consequence of relations (20.44). However, one can start from consid- 
erations based on direct physical intuition. In fact, the obvious “affinity” 
between the parity and the angular momentum is connected with the 
above-mentioned fact that the inversion operation includes rotation in 
addition to reflection, The order in which these operations are carried out 
is immaterial; rotation can follow reflection or, the other way round, it 


can precede reflection. 


So far, we have several times used the fact that the apparatus of quan- 
tum mechanics is based on the well-known equations of classical mechan- 
ics written, however, in operator form. This fact is so important that it is 


appropriate to return to it once again. 


We have mentioned above the brilliant guess by Schrédinger, who pro- 
posed the expression (20.12) for the Hamiltonian of a microparticle. If we 
use this guess and the fundamental result (19.10), we can easily see that 
the equations of classical mechanics can be actually transferred to quan- 
tum mechanics by replacing the physical quantities by the corresponding 
Hermitian operators. Substituting the operator Ê = x in (19.10), and using 


the expression (20.12) for the operator H, we get 
xa (20.47) 


This is the well-known classical result: the velocity is equal to momentum 
divided by mass. In quantum-mechanical interpretation this result means 
that the velocity operator is equal to the momentum operator divided by 
the mass. Further, we substitute the operator L = fẹ in (19.10). Using the 


expression (20.12) for H as before, we get 


d 
Px =~ U(x) (20.48) 
It can be easily seen that this is just Newton’s second law of motion, 


written in operator form. 


Remember, that the operators of the type x and px are introduced in 
accordance with the definition (19.9). Hence, it follows that the results 
(20.47) and (20.48) indicate the validity of the classical relations for the 


mean values of physical quantities: 


d 

ap 6%) = 45 | Be | 3) /m (20.47a) 
a = li 20.48 
PAS cs alae (x) (20.48a) 


Ehrenfest was the first to point out this and hence relations of the type 


(20.47a) or (20.48a) are called Ehrenfest theorems. In short, Ehrenfest’s 


The Relations of Classical 


Mechanics in Operator Form 
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theorems state that classical relations for physical quantities are trans- 
formed in quantum mechanics into relations for mean values of physical 


quantities. 


It was noted in Section 19 that the mathematical analogy between clas- 
sical mechanics and quantum mechanics considered here requires a cer- 
tain amount of caution since operators don’t always commute with each 
other. This leads to the conclusion that the information contained in the 
classical relations is insufficient for building up the quantum-mechanical 
apparatus. It is necessary to have additional information about the proper- 
ties of commutation of the operators under consideration. In other words, 
the classical relations must be supplemented by the commutation relations 


of the type (20.26)-(20.30). 


Thus, it is the commutation relations which carry the specific informa- 
tion essential to the apparatus of quantum mechanics. In this connection, 
we note that the right-hand sides of the commutation relations contain 


the specifically quantum-mechanical constant, i.e. the Planck’s constant. 


21 Applications of the Schrödinger Equation 


We shall mention three types of problems involving the solution of the 


Schrédinger equation. 


First type of problems. We consider the motion of a micro-particle 
in a limited region of space or, in other words, in a potential well (for 
example, the motion of an electron in an atom). Such a motion is called 
finite and the micro-particle is said to be in a bound state. In this case the 
time-independent Schrödinger equation is used [see (20.13) or (20.14)]. 
By solving the Schrodinger equation under certain boundary conditions 
imposed on the wave function and its first derivative”, the spectrum of the 
values of the energy of the microparticles and the wave functions of the 


stationary states can be found. 


Second type of problems. The infinite motion (motion unbounded in 
space) of a microparticle in an external field is considered. For example, 
the microparticle passes through a potential barrier (we recall the tunnel- 
ing effect mentioned in Section 4) or is scattered by some energy centre. 
Since the motion is infinite, the energy spectrum of the microparticle is 
continuous. By solving the time-independent Schrödinger equation, we 
can find the form of the wave functions of the microparticle far from the 
scattering centre (or barrier) from which, for example, the probability of 
scattering at a certain angle (or the probability of tunneling through the 


barrier as well as of reflection from it) can be calculated. 


Third type of problems. In the two types of problems indicated above, 


Some Characteristic Problems 


in Quantum Mechanics 


>The boundary conditions are consid- 
ered below during a discussion on specific 
problems. 
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we spoke of the stationary states of a micro-particle. For those the time- 
independent Schrödinger equation was used. The third type of problems 
involves a change in the state of a microparticle with time, and for this the 
time-dependent Schrédinger equation is used [see (20.17)]. By solving 
this problem we can find the probability of any quantum transition taking 


place under the influence of a given external force. 


Examples of problems of first two types are given in this section and 
in Section 24. The third type of problems can be found in Section 25. 
Naturally, we shall limit ourselves to just a few typical examples. But it 
should be mentioned here that the applied aspects of quantum mechanics 
are reflected quite comprehensively in the existing literature (in this 
connection, we recommend books by de Ter Haar and Fliigge® which give 


specially selected quantum-mechanical problems). 


Such a well is described by a potential U of the form 


co ifx <0, 
U(x)=40 if0<x<a, (21.1) 
co ifx>a, 
The parameter a is the width of the well. The energy is measured from 


the bottom of the well. Within the limits of the well (0 < x < a), the 
Schrédinger equation (20.13) is of the form 


ao 

aa ko =0 (21.2) 
where 

k? = 2mE/h? (21.3) 


At the boundaries of the well (for x = 0 and x = a) the continuous wave 
function g vanishes, since the infinitely high “walls” make it impossible 
for the particle to be found beyond the limits of the well.” Thus in this 


case the boundary conditions are of the form: 
p(0) = g(a) =0 (21.4) 
We can write the general solution of differential equation (21.2) as follows: 
g(x) =A sin(k x) + B cos(k x) (21.5) 
Since (0) = 0, it follows that B = 0. Thus, 
o(x) =A sin(k x) (21.6) 
From the condition g(a) = 0, we conclude that 


ka = mn (21.7) 


6de Ter Haar, editor (1965). Selected 
Problems in Quantum Mechanics. Aca- 
demic Press, N.Y; and Flügge, S. (1971). 
Practical Quantum Meehanics, Vols. 1 and 2. 
Springer, Berlin 


A Particle in a One- 
Dimensional Potential Well 


with Infinitely High “Walls” 


TIn the case of “walls” having a finite 
height, this possibility, as will be shown 
below, is not excluded. 
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where n is an integer. 


Taking (21.3) into account, the last result can be rewritten in the follow- 


ing form: 
2 2H? 
E, = n n 





21.8 
2ma? L 


The expression (21.8) determines the spectrum of values of the energy 
(energy levels) of the particle in the potential well. It coincides with the 


expression (5.2) derived earlier. 


According to (21.5) and (21.7), the wave function g,(x) corresponding 
to the nth energy level, is of the form ¢,(x) = Asin(znx/a). The integra- 
tion constant A is determined from normalization condition [see (15.13) in 


this a connection] 
a 


J èa: 


0 


2 
It can be easily seen that A = N and, consequently, 
a 


n(x) = Jan (=) (21.9) 


Thus, we have found the energy levels and the orthonormalized wave 
functions for the stationary states of a particle in a rectangular one dimen- 


sional well with infinitely high “walls”. 


Let us consider a rectangular potential well shown in Figure 21.1 (a). Rectangular Potential Well 
Since the particle is inside the well, E < U; and E < U}. The rectangularity with “Walls” of Finite Height 
of the potential enables us to clearly distinguish three spatial regions: 


region 1 (x < 0), region 2 (0 < x < a) and region 3: (x > a). 


We shall consider these regions separately and will then combine these 
results at the boundaries of the regions, i.e. at the points x = 0 and x = a. 


The Schrödinger equation (20.13) has the following form: 


d2 
for region 1: S — xtg =0, where x? = 2m(U, — E)/h? (21.10a) 
x 
: de 2 2 2 
for region 2: T +k“ =0, where kf = 2mE/h*, and (21.10b) 
. do 2 2 2 
for region 3: Fm x3 =0, where x; = 2m(U; — E) /ħ (21.10c) 
x 





The general solutions of these differential equations may be written in the 


following form: 
Figure 21.1: A particle with energy E ina 
rectangular potential well with “walls” of 


for region 1: @, = Ai exp( x x) + By expl- x) (21.11a) finite height, U; and Uz. 


for region 2: @2 =A exp(ik x) + Bz exp(-ik x), and (21.11b) 


for region 3: @3 = A3 exp(%2 x) + By exp(—%2 x) (21.11c) 
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[Note that the solution of equation (21.10b) may be written in the form 
(21.11b) or (21.5)] The boundedness of the wave function requires that Bı 
and A; be put equal to zero. Thus, 


Qı = Ay exp(% x) 
@2 = Ag exp(ik x) + By exp(-ik x) (21.12) 
p3 = Bs exp(—%2 x) 


The qualitative form of the functions 1, y3 and Re {@} is shown in Figure 
21.1 (b). The reader should pay attention to the fact that in the case of a 
potential well with “walls” of finite height, there always exists a probabil- 
ity of finding the particle beyond the limits of the well; this probability 


decreases exponentially with the distance from the boundaries of the well. 


In order to find the four coefficients (A1, A2, B2, B3), we make use of 
the fact that both the function and its first derivative must be continuous 
at the boundaries. The continuity of the wave function is obvious, while 
the continuity of the derivative can be easily shown. In order to do so, 
let us integrate Schrédinger’s equation (20.13) over a certain interval 
(a— A, a+ A) containing the potential jump. We get 


atA 
2m 


dọ dp 
att) - ue A) = Re J [U (x) — E]gdx 
a-A 
Since the functions under the integral sign are bounded, in the limit of 
A — 0 this integral vanishes. As a result we get 
dp 


dp, 
a, t= a 0) 


which was to be proved. Returning to our problem, we write the bound- 
ary conditions (conditions of ‘piecing’ together the solutions at the bound- 
aries of the region): 


1(0) = 92(0) 
p2(a) = 93(a) 


dg _ dez (21.13) 
zO = 7 ® 

doz dos 

T ® = a ® 


Substituting the expressions (21.12) in these equations, we get the system 


of equations for the coefficients (A1, A2, B2, B3): 


A, = A + Bo, 
A exp(ika) + B> exp(—ika) = B, exp(—%2 a), 
xı Ay = ik(A2 = B2) 


(21.14) 


ikA2 exp(ika) — ikB exp(—ika) = —x2 B3 exp(—2 a) 


The system (21.14) is a homogeneous system of linear equations. In order 
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that such a system should have non- trivial solutions, it is necessary that 
its determinant should be equal to zero. Equating the determinant of the 
system to zero, we get an equation for the energy E (we recall that the 
quantities k, x1, x are expressed in terms of E). The solutions of this 


equation will give us the possible values of the energy of the particle. 


We emphasize that from a mathematical point of view, the quantiza- 
tion of the energy in a well is a direct consequence of the homogeneity of 
the system (21.14). This is the crux of the problem of finding eigenvalues 
of physical quantities: the particle is left to its own devices, i.e. the exter- 
nal influences are eliminated thus eliminating inhomogeneities from the 
equation. As a result, the particle chooses its own course characterized 
by certain fundamental parameters (frequency, energy, etc.). The simplest 
example, taken from classical physics, is that of a pendulum. If a pendu- 
lum is not disturbed, it will oscillate with a definite natural frequency, 


irrespective of the way the oscillation have been induced. 


Thus, in order to determine the value of the energy of a particle, we 
should equate to zero the determinant of the system (21.14) and solve the 
equation so obtained. However, it is inconvenient in practice to consider 
a determinant of the 4th order. Hence we shall first simplify the system of 


equations. To do so, we rewrite the function @ in the form 
2 = Csin(kx + b) 


(this form is equivalent to the one used earlier; the reader may indepen- 
dently express the coefficients C and b in terms of the old coefficients Az 


and B2). The system (21.14) will now have the following form: 


A; =Csinb 
C sin(ka + b) = B; exp(—2a) 
(21.15) 
xıAı = kC cos b 
kC cos(ka + b) = —x2B3 exp(- xa) 
Dividing the third equation of this system by first and the fourth by sec- 
ond, we get the fourth by second, we get 


xı = kcotb, 


xz = —k cot(ka + b) 


(21.16) 


We now have a system of two equations in place of the system with four 
equations. From the first equation of the system (21.16) we find cotb = 
xı/k, hence 


sinb = (1 + cot? b) 1 = 
2mU, 
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Similarly, we get from the second equation of (21.16) 


kh 
Vv 2mU, 


As a result, we get an equation that determines the value of k (and, conse- 


sin(ka + b) = — 





quently, the energy levels) in implicit form: 


kh kh 


2m 





ka = nx — arcsin 





(21.17) 


S 
N 
3 
= 





where n is an integer. Figure 21.2 shows the left- and right-hand sides 


of equation (21.17) as a function of k. 


Figure 21.2: The number of energy levels 


of the particle are determined by the 
In the situation shown in the figure, the particle has three energy levels intersection of the straight line y = ka 


corresponding to values of k equal to ky, kz, k3. If we change the width of with values of k. The deeper the well, 


there will be more intersections leading to 


the well, the slope of the straight line y = ka will change thus changing 


more levels. 
the the position and the number of possible energy levels. 


A decrease in the width of the well will decrease the slope of the 
straight line. The energy levels will “creep” out of the well, and their 
number will gradually decrease. An increase in the width of the well will 
raise the line y = ka; it will intersect a larger number of branches of the 
inverse sine graph, thus leading to a larger number of levels in the well. 
As a — ov, the number of levels in the well will rise indefinitely, and 
we finally get a continuous energy spectrum. It is easy to examine simi- 
larly the effect of a change in the depth of tho well on the spectrum: the 


greater is the depth, the more levels there are in the well. 


We should proceed further in the following way: find from (21.17) the 
possible values of k and the values E, xı and x2 corresponding to them, 
then substitute these values in (21.15) and solve the system of equations 
containing the coefficients, and then substitute the final result in the 
expression (21.12) for the wave functions. However, on account of the 
mathematical complications involved, we shall not embark on this venture 


here. 


In conclusion, we note that (21.17) may also be used for determining 
the minimum energy E; of a particle in a potential well. For this we just 
have to consider the hatched triangle in Figure 21.2 and put kia ~ a, 
whence we get E1 ~ 17h? /2ma’. This result is in good agreement with 
the estimate (4.11) obtained in Section 4 on the basis of the uncertainty 


relations (3.3). 


When considering the motion of a particle in a spherically symmetri- A Particle in a Spherically 
cal field, it is convenient to use spherical coordinates r, 0 and g. Spherical Symmetrical Field 


symmetry of the field means that U (F) = U(r). The A-operator in spheri- 
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cal coordinates has the form 


10 fa) 1 
A= ——([r?—|]+—A 21.18 
r2 or | >) p20? ( ) 
where 
AE ce IE ANNE A oe (21.19) 
20 = n20 ap? sind a0 \” a0 i 


Taking this into account, we can rewrite the Schrödinger equation (20.14) 


in the form 


10 


42 (ež) P Zep Wr 6,0) + [E-U(r)] W(r,0,0) =0 (21.20) 


Equation (21.20) allows the separation of variables. This means that its 
solution may be found in the form of a product of two functions, one of 
which depends only on r, and the other on the angular coordinates 0 and 
Q: 

V(r, 0, 0) = R(r) ®(8, o) (21.21) 


Substituting (21.21) into (21.20), we get the following result: 





d ù „dR 22m 
T | a+r 72 EE HOIR- Mop ð 


R(r) ~ 0,9) 





(21.22) 


Since the left- and right-hand sides of (21.22) depend on different inde- 
pendent variables (on r and on @ and ọ, respectively), both sides must be 
equal to some constant, which we denote by A. Introducing this constant, 
we write 

—Aog (8,9) =A B(O, p) (21.23) 


Comparing (21.19) with (20.23), we conclude that (21.23) is in fact the 
equation for the eigenvalues and eigenfunctions of the operator M’. This 


allows us to use (20.39) and (20.40) and write 


A=I(1+1), 1=0, 1, 2, (21.24) 


E |21+1 (l-Im)! Îmi : _ 
(0, ¢) = mE (F impi! cos 0 exp(im@) = Yım(0, g) (21.25) 


where m = 0, +1,... + l. The functions Yım(0, ọ) are spherical harmonics 





(they were introduced in Section 20). It will he useful to write down the 


expressions for the first few spherical harmonics: 


Yoo = ———— (21.26a) 


3 3 
Yio = ap cos@, Yia1=4/ = sin 6 exp(+i) (21.26b) 
T 


2 
m5 
Yz +1 = 4/ — sin 0 cos 0 exp(+iọ) 
8m 
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15 
Yo, 42 = 4/ 35 sin? 0 exp(+2ig) (21.26c) 


We emphasize that the “angular part” of the wave function is independent 
of the particular form of the potential U(r); this is a direct and important 


consequence of the spherical symmetry of the potential. 


We now turn to the “radial part” of the wave function, i.e. to the func- 
tion R(r). According to (21.22) and (21.24), this must be a solution of the 
equation 


[E -Uj(r)] R(r) =0 (21.27) 





1 d {dR _ 2m 
r 

r2 dr dr h2 
where we have introduced the notation 


f2l(1 + 1) 


Ui(r) =U) + 


(21.28) 


It should be noted that (21.27) may be reduced to the one-dimensional 
Schrédinger equation with a special boundary condition at r = 0. For this 


we must use the substitution 
g(r) =rR(r) (21.29) 


and in view of the boundedness of the function R(r), require that the 
condition ¢ (0) = 0 be satisfied. It can be easily seen that the substitution 
(21.29) in fact converts (21.27) into the one-dimensional Schrédinger 
equation i 

e + = [E-U,(r)] e(r) =0 (21.30) 
In this case the boundary condition (0) = 0 corresponds to the one- 
dimensional potential well having an infinitely high vertical wall on the 
left (at r = 0). 


Further study of (21.30) obviously requires a consideration of the par- 


ticular form of the potential U(r). 


It should be noted that among the problems on the motion of a particle 
in a spherically symmetrical field are the problems of an electron in an 


atom and the scattering of particles by spherically symmetrical centres. 


We shall establish a formal analogy between the time-dependent 
Schrédinger equation and the continuity equation, which is widely used 
in classical physics, especially in hydrodynamics. We assume that there is 
a certain medium (for example, a liquid) described by the functions p(7) 
and d (F). [p(F) is the density of the medium and @ (7) is the velocity of 
the particles of the medium at the point 7; naturally, these functions may 
also depend on time]. Let us imagine a certain volume V in the medium 


to be isolated. The change in the quantity of liquid in this volume per unit 


The Continuity Equation and 
the Schrödinger Equation 
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time is equal to 


|% 


dV 
if? 


V 


& 


Let us isolate a certain element of surface area dS on the area S bounding 
the volume V, and associate with it a vector dS equal to dS in magnitude 
and directed along the outward normal to the surface. The amount of 
liquid passing per unit time from the volume V through an element of 
surface area dS is equal to pd dS. The amount of liquid passing through 


the entire surface per unit time is equal to 


$ pads 


S 


The law of conservation of matter requires 


at 
V S 
Thus, 
ð 
2 f paved padi - 
V S 


By replacing the integral over the closed surface by a volume integral, we 


can rewrite the last equation in the following form: 


ð 5 

J (Zravi}av=o (21.31) 
ot 

V 


where j = pi is the vector of the density of flow of the liquid. 


Equation (21.31) is independent of the choice of the volume V. Using 
this fact, we can decrease the volume V to some particular point. In the 


limit as V — 0, (21.31) becomes a differential equation at this point: 
op > 
— +divj=0 21.32 
F j (21.32) 
This is the classical continuity equation. 


Let us turn to the time-dependent Schrödinger equation. We shall 
generalize the one-dimensional equation (20.17) for the three-dimensional 


case: 
ə h? 
in—W(F, t) = -— AY (F, t)+ U(r) ¥(F, t) (21.33) 
ot 2m 


We introduce purely formally a certain “medium” and define the density 
of this “medium” as ¥¥*. This density may be called the “probability den- 
sity”. In other words, the probability of finding a particle will be greater at 


those points of space, where the density of the “medium” is higher. 


This “probability density” may be given a fairly simple physical mean- 


ing if we imagine that the space is filled with a large number of particles 
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(the interaction of particles is to be neglected in this case). Obviously, the 
number of particles in some volume AV is proportional to the probabil- 
ity of finding a particle in this volume. With this approach, ¥¥* may be 


considered simply as the density of the number of particles. 


As in the classical case, we start by considering a certain finite volume 


ð d¥ d¥* 
2 pav=2 [wav f wr — 4+ 9 dV 
ot ot dt dt 

V V V 





* 





d 
Substituting into this JE and from the Schrödinger equation (21.33), 


dt 
and from the complex conjugate of equation (21.33), we get 


a ih 
2 [wear f wrav- yaw av 
ot 2m 

V V 


ih 
ee fa (F*VY — YVY*) dV 
V 


~ 2m 


The last result can be written in the form 
igh ee ae iñ y + 
—(¥¥*) - div | —Y*VY — YVY¥* | dV =0 (21.34) 
ot 2m 
or after decreasing the volume V to a point, 
ð 5 ; ih _., , 
—(¥Y") - div | — VV - PVP" =0 (21.35) 
ot 2m 


By drawing an analogy between (21.34) and (21.31) [or between (21.35) 
and (21.32)], we come to the conclusion that the Schrödinger equation 
corresponds to some quantum-mechanical continuity equation, if in 


addition to the probability density® 
p=VV= 0% (21.36) 
we also introduce the vector of probability flow density 


> ih tea 

j=— (YV - VV) 
p (21.37) 

J (PVo* - ¢*Vo) 
m 


If we interpret (21.36) as the density of particles, then the vector (21.37) 
may be considered as the vector of density of the flow of particles. With 
such an interpretation the quantum-mechanical continuity equation 


(21.35) expresses the law of conservation of the number of particles. 


In the case of a one-dimensional motion along the x-axis, (21.37) as- 


sumes the form 





j= (21.38) 


ih | dọ* „dọ 
2m ? dx Y dx 


In conclusion, let us note that we cannot assign the meaning of “flow” in 


8The functions ¥(F, t) and g(r) are 
related to each other through a relation of 
the type (20.16). 
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the literal sense to the quantum-mechanical vector J, since in order to 
determine the flow through any surface we must be able to measure the 
values of the velocity (momentum) at fixed points of the surface, which is 


obviously in contradiction to the uncertainty relations. 


Let us consider a one-dimensional rectangular potential barrier (Fig- 
ure 21.3) and assume that particles arrive at it from the left with an energy 
E which is less than the height U of the barrier. We can isolate three spa- 
tial regions and write the solutions of the Schrödinger equation (20.13) for 


these regions: 





2mE 
Q1(x) = A; exp(ikx) + By exp(-ikx); k= 7 
2m(U — E 
@2(x) = Az exp( xx) + Bz exp(—xx); x= E (21.39) 


93(x) = A; exp(ikx) + B3 exp(—ikx) 


The terms containing exp(ikx) describe the particles moving in the 
positive direction of the x-axis while the terms containing exp(—ikx) 
indicate motion in the opposite direction. If we take into account that 
the particles are moving in the positive direction, we must exclude the 
second term in the function g3 : B3 = 0. The other coefficients are non- 
zero. The term with A, describes particles falling on the barrier, the term 
with B, describes those reflected from the barrier, while the term with A3 


describes particles which have passed through the barrier. 


The conditions of continuity of the wave function and its derivative at 


points x = 0 and x = a give the following system of four equations: 


A, + By = A + Bo 
Az exp( xa) + Bz exp(—xa) = A; exp(ika) 
(21.40) 
ik(A, — By) = (Az — B2) 


%[A2 exp(xa) — Bz exp(—xa)] = ik A; exp(ika) 


It turns out that we have just four equations for five coefficients! But 
actually, only four and not five coefficients are known. The density of 
flow of particles incident on the barrier (jinc) must be given. This density 
is given by (21.38), where we must substitute ọ = A; exp(ikx). As a result 
of this, we get 

Jine = |A1|"Ak/m (21.41) 


Thus by knowing the quantity jin: we can determine the coefficient Aj. 


Similarly for the density of flow of the reflected particles we get 
jef = |By|?hk/m (21.42) 


and for the density of flow of particles passing through the barrier we 


Passage of a Particle Under or 


Over a Potential Barrier 





Figure 21.3: Particle with energy E passing 


under the potential barrier with energy U. 
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have 


Ja = |43| ħk/m (21.43) 


Usually in such problems the density jinc is chosen in such a way that 


A, = 1. In this case the system (21.40) assumes the form 


1+ Bı = A + B2 
A, exp( xa) + B, exp(—xa) = A; exp(ika) 
(21.44) 
k(1 — By) = x (A2 — Bz) 


x [Az exp(xa) — Bz exp(—xa)] = ik A3 exp(ika) 


The system (21.44) is the non-homogeneous system of four linear equa- 


tions containing four unknown coefficients. 


The inhomogeneous system has a solution for any values of k and x, 
i.e. for any values of the energy E of the particle. This is in agreement 
with the fact that for infinite motion of the particle its energy is not quan- 


tized. 


We can determine the fraction of particles that has passed through the 
barrier: 
Je 


Jinc 


D= (21.45) 


The quantity D is called the transmission coefficient of the barrier. Solving 


the system (21.44) (we shall omit the steps), we get 


4ik 
A3 = slat exp(—ika) 
x 





ik\? ik\? 
exp( xa) (1 - x) — exp(—xa) f + *) (21.46) 


Further, using (21.43) and (21.45), we find 


4k? x? 
D= — (21.47) 
4k? x? + (k? + x?)? sinh“ (xa) 





In the particular case when xa > 1, (21.47) is simplified as 


D = Do exp SANTAT — E) (21.48) 
h 


where 


Desig 
er ie U 


In addition to the transmission coefficient, we also have the coefficient of 
reflection at the barrier, defined as the fraction of the particles reflected by 
the barrier: R = jret/jinc. It is clear from basic principles that D + R = 1 (all 
the particles not passing through the barrier must be reflected by it). 


Finally we consider the case when a particle passes over the barrier 
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(E > U). In this case, instead of (21.39) we get 


g(x) = exp(ikx) + Bı exp(—ikx) 
(x) = Ag exp(iKx) + By exp(—iKx) (21.49) 
(x) = A; exp(ikx), where K = y2m(E —U)/h 


Using (21.49), we write the boundary conditions for the points x = 0 
and x = a. We then solve the system of equations so obtained and find 
the coefficient B, Further, from (21.42) we determine the coefficient of D 


reflection R = jref/ jine. It turns out to he equal to 1 


(k? — K*)? sin? (Ka) 


eee eee ee 21.50 
4k2K?2 + (k2 — K2)? sin? (Ka) ( ) 





Using (21.47) and (21.50) we can find the dependence of the transmis- 
sion coefficient D on the ratio E/U. This dependence is shown graphically JE E & 
in Figure 21.4. The same figure shows the dependence D (E/U) for a Figure 21.4: Classical and quantum 
classical particle (dotted line). A comparison of the solid curve with the transmission and reflection coefficients. 
dotted line indicates the quantum-mechanical nature of microparticles. 

Note that for E < U in classical mechanics all particles are reflected from 

the barrier and not a single particle passes through. In quantum mechan- 

ics, however, a part of the particles is reflected and a part passes through 

the barrier. For E > U, in classical mechanics all particles pass through 

and not a single particle is reflected, while in quantum mechanics a part 

of the particles passes through, and another is reflected. Both sub-barrier 

transmission and above-barrier reflection of microparticles are specifically 


quantum effects. 


22 The Hamiltonian in Some Specific Problems 


The Hamiltonian is of the form [See Appendices E and F]. Linear Harmonic Oscillator 
` K d mox 
H=-—— + —— 22.1 
2m dx? 2 vAN 
It is obtained from (4.5) by taking (19.1) and (20.4) into account. The 
eigenvalues are 
1 
En = ho | + n) n = 0,1,2, (22.2) 


[for n = 0 we get from (22.2) the energy of zero-point oscillations, which 
was determined in Section 4 on the basis of the uncertainty relations]. 


The eigenfunctions are 


n(x) = Ymo /ħ exp(-€?/2) Ha (£) (22.3) 
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where ë = xymo/h, and H,,(€) are called Hermite polynomials. Let us 


write down the expressions for the first few functions g(x): 


go(x) = (xo Vr)? exp(—x?/2x9) (22.4a) 

g(x) = (2x0 Vm) exp(—x?/2x?) 2x/xo (22.4b) 

p(x) = (8xoVx) 1? exp(—x?/2x3) (5 — | (22.4c) 
0 


where xo = Vhi/mw. 


Note. Hermite polynomials, generalized Laguerre polynomials (men- 
tioned below), harmonic functions and Legendre polynomials (introduced 
in Section 20) are called special functions. A considerable amount of math- 
ematical literature has been devoted to special functions and their appli- 
cations; for reference we mention books by Smirnov and Lebedev.” A 
basic account of polynomials and associated Legendre functions, special 


functions and Hermitian polynomials is given in Appendix E. 


The problem of the hydrogen atom is a well-known example of the 
motion of an electron in a spherically symmetrical Coulomb field. The 


Hamiltonian has the form 
H=-—A-— (22.5) 


[it is obtained from (4.1) by using (19.1) and (20.6)]. The eigenvalues of 
this Hamiltonian are given by the following familiar expression [see (2.5a)- 


(2.5b)]: 
met 
2h? n2’ 


The eigenfunctions of the Hamiltonian (22.5) may be expressed in the 


En = n= 1,2,3. (22.6) 


form 
Ynim = Rar) Yin (@, ¢) 1=0,1,2,...n-1 m=0,+1,...41 (22.7) 


Here Yım(0, o) are spherical functions. They define the “angular part” of 
the wave function irrespective of the particular form of the spherically 
symmetrical potential; R„;(r) is the “radial part” of the wave function, it is 
defined by (21.30) with the Coulomb potential U(r) = —e*/r. The form of 


the function R,;(r) is described by the expression 


I 
2 2 
Rni(r) = const exp (=) (=) p (z) (22.8) 
rın rın rın 
where r, = h? /me (this quantity is already known to the reader as the 
radius of Bohr’s first orbit) and L2!*! are the so-called generalized Laguerre 


n+1 
functions (see, for example, Smirnov?) 


Expressions for some of the first harmonic functions have been given 


9 smirnov, V. I. (1954). A Course of H 
igher Mathematics, Vol. 3, Part 2. Pergamon 
Press, Oxford; and Lebedev, N. N. (1965). 
Special Functions and Their Applications. 
Prentice-Hall, N.Y 


The Hydrogen Atom 


10¢mirnov, V. I. (1954). A Course of H 
igher Mathematics, Vol. 3, Part 2. Pergamon 
Press, Oxford 
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in Section 21 [see (21.26). We shall now give the expressions for the first 


few functions R,)(r): 


Rio = 2r” exp(—r/r1) (22.9a) 
Roo = (2r3)1/? exp(—-r/2r1) (1 - =) (22.9b) 
2r1 
-1 
Ra = (eri) exp(=r/2r1)(r/r1) (22.90) 


In Section 5, when discussing the concept of an electron cloud: we intro- 
duced the functions vp; and Zım. Figure 5.2 (a) showed the form of some 
of the functions wy)(r) = r*vn)(r), while Figure 5.2 (b) showed some of the 
functions Zim. Returning to the functions considered in this section, we 
note that v,; = RÈ (r) and Zim = |Yım(0, ọ)|?. In particular, we note that 
(5.4) is in agreement with (22.9a). 


By using (22.9) and (21.26), we can write the first few eigenfunctions of 
the Hamiltonian (22.5): 


Yioo = (arf)? exp(-r/r) (22.10a) 
Ya = (873)? exp(-r/2r1) (1 = Z) (22.10b) 
Yan = (8yf2r3) exp(—r/2r) sin 8 exp(ip)r/n (22.10¢) 
Wo10 = (44/2ar3)~* exp(—r/2r,) cos Or/r; (22.10d) 


The function (22.10a) describes the ground state of the hydrogen atom, 
while the functions (22.10b)-(22.10d) describe the excited states corre- 
sponding to the first excited energy level (n = 2). 


It follows from (22.6) that the energy of an electron in the hydrogen 
atom is determined only by the quantum number n, while the states (the 
functions Ynım) are determined by three quantum numbers n, l and m. 
Besides, when considering electronic states we must take into account 
the quantum number o which does not occur in these expressions. Since 
for a given value of the principal quantum number n the orbital quantum 
number l assumes integral values from 0 to n — 1, and for every l the 
magnetic quantum number acquires 2/ + 1 values, the following gn states 


must correspond to an energy level E,,: 
n-1 
gn = 2) (al +1) = 2n? (22.11) 
1=0 


(the factor 2 takes account of the two spin states of the electron). This 
means that the eigenvalue E, of the Hamiltonian (22.5) (in other words, 


the nth energy level) is 2n’-fold degenerate. 


Degeneracy of energy levels, as a rule, is associated with symmetry in 


On Degeneracy of Energy 


Levels 
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the atomic system. Thus, for example, owing to the spherical symmetry 
of intra-atomic fields, there is degeneracy of the quantum numbers m and 
o — the energy is independent of the orientation of the orbital momentum 
and the spin momentum of the electron. The degeneracy of the quantum 
number l is associated with the specific nature of the Coulomb potential; 
in non-Coulomb fields the energy of the electron depends not only on n 


but also on I. 


Various fields, external as well as internal, may lower the degree of 
symmetry of the system. Thus, for example, “switching on” an external 
electric field leads to the appearance of a physically distinguished direc- 
tion. As a result, the spherical symmetry disappears and is replaced by 
cylindrical symmetry. A decrease in symmetry leads to a removal of de- 
generacy (partial or total). This is exhibited in a splitting of energy levels, 
i.e. in their conversion into sets of new, less degenerate levels. Splitting of 
energy levels in an external electric field is known as Stark’s effect, while 


the splitting in an external magnetic field is known as Zeeman’s effect. 


The Hamiltonian of a crystal consisting of N nuclei and ZN electrons Crystal; the Adiabatic Approxi- 
can be written in the form mation 
N ZN 
A= P+ Y RUUAN + UR) +UA) (22.12) 
2M S404 2m4 k aes 


where M is the mass of the nucleus, P; is the momentum operator for 
the ith nucleus, m is the mass of the electron, pp is the momentum oper- 
ator for the kth electron, {rk} is the set of coordinates of the electrons, 
{Ri} is the set of coordinates of the nuclei. The function U; describes the 


interaction of electrons. It is of the form 


z 1 e? 
U=5 2 » = (22.13) 


Fkl 


where ry is the distance between the kth and lth electrons. The function 


U; describes the mutual interaction of the nuclei, while the function U3 


11 11The functions U1, U2, U3 describing 


the various interaction potentials, in fact 
correspond to operators in the coordinate 
representation. 


describes the interaction of the nuclei with the electrons. 


Since M > m, the nuclei move much more slowly than the electrons. 
This permits us to consider the motions of nuclei and electrons sepa- 
rately: when considering the motion of electrons, we assume that the 
nuclei are stationary, while when considering the motion of the nuclei, 
we assume that the electrons collectively create an average field which 
is independent of the coordinate of individual electrons. In this case the 
wave function of the crystal may be represented in the form of a product 


of “nuclear” and “electronic” functions: 


PFD {RH = YURD peA) (22.14) 
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We also represent the Hamiltonian (22.12) as a sum of the “nuclear” Hamil- 


tonian H; and the “electronic” Hamiltonian Ho: 


N 


3 1 ` X 
H, = T P? + U2({Ri}) (22.15) 
> 1 e? = 

Ĥ = mAT Dag D (22.16) 


The function U; describes the potential energy of the electrons in the 


field of the nuclei which are located at the lattice sites in the crystal. 


Thus, instead of solving an extremely complicated Schrödinger equa- 


tion for the entire crystal 
AYAR), (RH = EYA} (RD) (22.17) 


it is enough to solve two much simpler equations: 


(a) for nuclei (for crystal lattice) 


AYAR} = BYR} (22.18) 
(b) for electrons 
Pope Fk} = E perk} (22.19) 


where E* = E'*' + E°, This approximation is called adiabatic. 


In conclusion, we make a fairly important clarification. When using 

the adiabatic approximation we consider, strictly speaking, not the bare 
nuclei, but nuclei together with those electrons which are tightly bound 
to them. Consequently, when we speak about a separate treatment of the 
electrons ensemble, we mean not all the electrons, but only those which 
have been “collectivized” by the crystal (in other words, the electrons 


moving over the crystal lattice, for example, the conduction electrons). 


In accordance with the adiabatic approximation, we shall consider, 
without going into the dynamics of the crystal lattice, the motion of elec- 
trons which have been “collectivized” by the crystal. We use the expres- 
sion (22.16) and consider that the function U3({r;.}) may be represented 
as a summation over the “collectivized” electrons (since each electron 


interacts with the lattice field independently from other electrons): 
URH = X, UF) 
k 


In this case the “electronic” Hamiltonian (22.16) assumes the form 


Hea At gt DUA (22.20) 


k4l 


One-Electron Approximation 
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Further simplification is based on the assumption that the term 
1 e? 
2o 
in (22.20) may be approximated by a summation over the electrons: 


1 e? = 
; >; 8 X, Us(F) (22.21) 
k4l k 

In other words, when considering electron-electron interaction, it is 
assumed that each electron moves in a certain field which is common 

for the whole ensemble (it is called the self-consistent field). As a result, 
the Hamiltonian of the electron ensemble can be represented as a sum of 
“one-electron” Hamiltonian. This allows us to represent the wave function 
of the ensemble in the form of a product of “one-electron” functions [we 
denote them by 9 (7,)], after which the Schrödinger equation for the 


electron ensemble turns into set of “one-electron” equations of the form 


3 


2 


2 +U) +Us(¥)| oF) =E) (22.22) 


Here p and F are the momentum operator and the coordinate of one of the 


“collectivized” electrons, E being the energy of the electron. 


Thus, by using (22.21) we can go over from a consideration of the 


electron ensemble to consideration of a single electron moving in the field: 


U(r) = U(f) + Us (F) (22.23) 


This transition is called the one-electron approximation. 


The potential U (F) is a periodic function with the period of the crystal 
lattice. It will be shown in Section 24 that the energy of an electron mov- 
ing in a periodic field is broken up into alternate bands of allowed and 
forbidden values, i.e. has a band structure. An electron hound to an atom 
has energy levels, while a free electron is characterized by a continuous 
energy spectrum. An electron “collectivized” by the crystal occupies an 
“intermediate” position to a certain extent — it is “free”, but only within 
the limits of the crystal. The band structure of the energy states of such 
an electron is obvious and is “intermediate” between the structure of 


discrete levels and that of a continuous spectrum. 


The relative freedom of movement of a “collectivized” electron is re- 
flected, in particular, in its wave function, which is represented in the 


form of what are called Bloch functions: 


(r) = u(F) exp(ipr/n) (22.24) 
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This is the wave function (15.15) of a free electron, modulated by the 
function u(r), which has the period of the potential U (F) (for more about 


Bloch functions, see Section 24). 


We shall consider the system of a bound electron plus radiation. In the 
absence of interaction between the electron and the radiation, the system 
is described by the “unperturbed” Hamiltonian: 


A2 


A eed, 


22.25 
ma (22.25) 


A2 

where P + U is the Hamiltonian of the electron, and H; is the Hamilto- 
m 

nian of the radiation. In the case of interaction between the electron and 


the radiation, the system is described by a “perturbed” Hamiltonian: 


3 e»)? 
a 
H= £ (22.26) 
2m+U +H, 


where A is the operator of the vector potential of the radiation field [we 


recall that in the coordinate representation A(?) = A(?).12 


Note that the field potentials here have been chosen in such a way that 
the well-known calibration conditions div A = 0 and y = 0 (ọ is the scalar 
potential of the field) are satisfied. Next, we represent the Hamiltonian in 
the following form: 


Ê = Ĥ + Ê’ (22.27) 


where H’ is the interaction Hamiltonian, which plays the role of the 


perturbation. Comparing (22.25), (22.26) and (22.27), we find that 


e? 





Was. |PA+ (4p)| + A (22.28) 
2mc 2mc? 


This expression can be somewhat simplified if we put, in accordance with 
(20.27), 

PAC) -APB = -ih div A P) 
Using the fact that div A = 0, we get 


2 

A e a => e 
H’ = -——(pA)+ A 22.29 
aa te ) 2mc?2 ( ) 





It should be noted here that the Hamiltonian (22.29) is responsible for all 
processes of absorption and emission (spontaneous as well as induced) of 


photons by an electron. 


23 Transition to the Momentum Representation 


We shall show how to go over from the coordinate to the momentum 
representation and shall give some results in the momentum representa- 


tion. 


The Hamiltonian of the In- 
teraction of an Electron with 


Electromagnetic Radiation 


121 is shown in classical field theory 
[for example, see Landau and Lifshitz, The 
Classical Theory of Fields ] that the inter- 
action of a charge with an electromagnetic 
field may be considered by replacing p by 
P = <A, We use this classical result here, 
replacing the dynamic variables with the 
corresponding operators. 

Landau, L. D. and Lifshitz, E. M. (1975). 
The Classical Theory of Fields (4th edition). 


Pergamon Press, Oxford 
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Obviously, the momentum operator in the momentum representation Momentum and Coordinate 
is the momentum itself: Operators in the Momentum 
: me es Representation 
px=px p=pPp. (23.1) 


We shall, therefore, go over directly to a consideration of the x-coordinate. 


Suppose that the amplitude of a state is given in the coordinate repre- 
sentation by the function ¢ (x), and in the momentum representation by 


the function ®(p,) By using (17.33), we can write 


J Ewrop / DP) t (px) O(pu) dpe (3.2) 


The relation between the functions ¢ (x) and ®(px), according to (15.6), is 
of the form 


0s) = f OPa) Vo dps (23.3) 
where wp, (x) are the eigenfunctions of the operator px in the coordinate 
representation. By using (20.9) we can rewrite the expression (23.3) in the 
following form: 


p(x) = © (px) exp(ipxx/h) dpx (23.4) 


va | 


By substituting (23.4) in the left-hand side of the equality (23.2), we get 


* _ 1 * , st - , 
i, o (x) x dx = —— [If (pL) exp(-ipex/h) x (px) exp(ipsx/h) dps dps dx 
(23.5) 
The factor x ®(p,.) exp(ip,x/h) under the integral sign may be expressed 


in the following form: 


d 
x O(p,) exp(ip,x/h) = the [® exp(ip,x/nh)] 
Px (23.6) 
ge [® exp(ip,x/n)] 
+i TA plipx 


We substitute (23.6) into (23.5) and consider the integral with respect to px. 


In so doing, we take into account that 


-ih | = [® exp(ipxx/f)| dpx = -ih (px) exp(ipex/f)|%, = 0 


x 
—00 


[since it is physically impossible to attain an infinitely large momentum, 

we get (co) = 0 and ®(—o) = 0]. Thus in the integral with respect to px, 
d® 

only the second term on the right-hand side of (23.6), erm exp(ipxx/h) 


should remain. As a result, (23.5) assumes the form 


1 dolp), 
J oxo ae = iz SSS Coin explitos -pdr SP? api dps dx 


(23.7) 


Integration with respect to x on the right-hand side of this equation gives, 
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according to (15.17), 


1 . , = — p’ 
aan | oP [i(px — p,)x/h| dx = (px — px) 


Further, using the property of the delta function, we perform an integra- 


tion with respect to p}: 


| D (P1) 5(pe— pL) dp! = (py) 


As a result, we are left with only the integral with respect to pyx and (23.7) 


assumes the form 


d 
J T E J Dp) ihi prds (238) 


Comparing the right-hand sides of (23.8) and (23.2), we find the expres- 


sion for the x-coordinate operator in the momentum representation: 


£ (px) = ine (23.9) 


A generalization to the three-dimensional case gives 
f (P) = ihV; (23.10) 
where V; is the gradient in the momentum space. 


By using(23.1) and (23.9) it is easy to see that the commutators of 
the operators of the coordinate and momentum components will be ex- 
actly the same in the momentum representation, as in the coordinate 
representation [we are speaking of the expressions (20.24)-(20.26)]. This 
conclusion may also be extended to the expressions (20.27)-(20.30). In 
other words, the commutation relations are independent of the choice of 
a representation, i.e. are unitary invariants. This is quite natural, if we re- 
call that the mathematical fact of commutation of operators has a definite 
physical meaning which, obviously, cannot change while going over from 


one representation to another. 


Going over to the momentum representation, we can write (20.11) in 


the form 


H (px) te(px) = E te (px) (23.11) 


where Tz (px) are the eigenfunctions of the Hamiltonian in the momen- 
tum representation. Note that the quantities E in (23.11) are exactly the 
same as in (20.11), since the spectrum of eigenvalues of a Hermitian opera- 


tor is a unitary invariant. Since in the momentum representation py = px 


and x = ii——., the Hamiltonian (20.12) will now have the form 
x 


2 
H(px) = a +U (až) (23.12) 


Unitary Invariance of the 


Commutation Relations 


Schrödinger Equation in the 


Momentum Representation 
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As a result, we get the following equation in place of (20.13): 


Px ate) | 
(2 z) TE(px) +U (a=) =0 (23.13) 


This is the time-independent Schrödinger equation in the momentum repre- 


sentation. 


As an example, let us write out the Hamiltonian of a linear harmonic 


oscillator: 3 ap ge 
A %2 mo 

Alps) = - 

m 2 dpx 





(23.14) 


Compare this expression with (22.1), which describes the same Hamilto- 


nian in the coordinate representation. 


The momentum representation allows us to obtain fairly easily one 
result with which the reader is already familiar. If a microparticle moves 


freely, equation (23.13) is obviously simplified: 


p 
(2 - z) TE(Ppx)=0 (23.15) 
2m 
from which it immediately follows that 


E= X (23.16) 
The result (23.16) has been already mentioned in Section 1. It means that 
a freely moving particle simultaneously possesses a definite energy and a 
definite momentum; moreover, these quantities are related to each other 
by the classical relation (23.16). In the case of a freely moving micro- 
particle, the stationary state is also an eigenfunction of the momentum 


operator. We emphasize that this can in no way be extended to bound 


micro-particles (see the following example). 
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In Section 21 we considered the problem of the motion of a particle in Probability of Momentum 
a one-dimensional rectangular potential well with infinitely high walls in Values for a Particle in a Rect- 
the coordinate representation. The energy levels (21.8) and the orthonor- angular Well with Infinitely 
malized amplitudes of stationary states (21.9) were determined. High Walls 


When going over to the momentum representation, the result (21.8) 
obviously does not change, while the result (21.9) changes. By finding 
the amplitude of stationary states in the momentum representation, we 
can also find the probability of values of momentum of a particle in the 
n-th energy state. We denote these amplitudes by T (px); the required 
probability will then be |t,(px)|?. The amplitude tn (px) are related to the 
amplitude of stationary states in the coordinate representation [to the 


amplitudes 9,(x)] by superposition relations of the same type as (23.3): 


Boas J onl Px)V.(Px) dx (23.17) 
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where y} (px) are the eigenfunctions of the operator £ in the momentum 
representation. By using the fact that y4 (px) = Vox (x), and (20.9), we can 


rewrite (23.17) in the following form: 


Tn (Px) = n(x) exp(—ipxx/h) dx (23.18) 


a 
vem 
2h A 
By substituting (21.9) into this equation and integration, we finally come 


to the following expressions for the probability: 
2 cos? (=5), n is odd 
2h 


—— a (23.19) 
2 x . 
h (er = ne) sin’ (=) , nis even 


It (px)I? = 


h2 


Thus, it has been rigorously shown that the stationary states (energy 
levels) of a particle in the potential well are not characterized by a definite 
momentum but by a corresponding definite de Broglie wavelength. We 
remind the reader that this circumstance was qualitatively discussed in 
Section 5 when we solved futility of a graphical representation of a bound 


microparticle in the form of a classical wave in a resonator. 


Summing up, we can compile a “scheme” for the transition from one A Scheme for the Transition 


representation to another as follows: from the Coordinate to the 


Momentum Representation 


Figure 23.1: A “scheme” for the transition 
from the coordinate to the momentum 
representation. 





This “scheme” assumes two methods of transition. The first method: 


the Schrédinger equation with the Hamiltonian H(x) is solved and the 
amplitudes of stationary states g,(x) are found in the coordinate repre- 
sentation. Then with the help of the superposition relation (23.17) we 
perform a transition from the amplitudes p(x) to the amplitudes tn (px) 
This is the method that was adopted in the above example. However, it is 
also possible to follow a second method: the transition from H(x) to H(px) 
is made and the Schrédinger equation in the momentum representation 
(23.13) is solved. In this case, the derivation of the amplitudes Tp (px) is 
reduced to the solution of the equation (23.13). 
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24 An Electron in a Periodic Field 


The quantum-mechanical problem of an electron in a periodic field 
plays an important role in the solid-state theory. We turn to this problem, 


using the one-electron approximation discussed in Section Section 22. 


Let us consider a one-dimensional periodic potential U(x) satisfying 
the condition 


U(x +a) = U(x) (24.1) 


Following the second method in the scheme given at the end of the pre- 
ceding section, we change over to the momentum representation. This 

means that the potential U(x) should be expressed as an operator in the 
momentum representation, U(p,). (In order to simplify the notation we 


shall write p for px here.) 


We expand the periodic function (24.1) in a Fourier series: 


U(x) = 3 ees (-="| 





a 


n=—oco 


and, changing to the momentum representation, we write 


a = 2anh d 
U(p) = >, U, exp | - $) (24.2) 





d 
We shall now show that the operator exp (> z) is a displacement opera- 


tor with a finite displacement in p-space by the amount p = pı. This is so, 











as 
dt ee ie 
T(p+ pi) =t(p) + P (p) + 5 Pi dp? +... 
S d 1, @ 
= Pp a m t(p) 
Pz 
= ex — 
p |P1 dp 
Thus j 
exp (p =| = 1 (p+ pi) (24.3) 
From (24.2) and (24.3) it follows that 
a = 2mh 
O(p) r(p) = > Unt (p+ a (24.4) 
n=—oo a 


By using (24.4) we can write the Schrödinger equation (23.13) in the 


following form: 


(= -z) t(p)+ > Unt (P+ arrn) = (24.5) 


2m a 


n=—oo 


In fact, (24.5) is a homogeneous system of linear equations containing the 


The Band Structure of the 
Energy Spectrum, Brillouin 


Zones 
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aah aah 
functions t(p), T [e = =) ,T l + “at etc. Generally speaking, this 
a a 


system consists of an infinite number of equations: 





2nh\* 
a ae 2nh 2mh(n + 1) 
Se en r(p+ om) + Ditae(P+ =m = 
2 co 
Z -z r(p)+ Y, Unt [+ | =0 (24.6) 


(corn z|r(p- 2) + YU [os + Pea D) i 


Non-zero solutions of this homogeneous system are possible only if its 
determinant is equal to zero. We denote this determinant by D(E, p) and 
symbolically write 

D(E,p)=0 (24.7) 


We fix p (let, say, p = pı) and write the roots of equation (24.7) as 

E: (pi), E2 (pi), Es (pı) ... For a different value of p (say p = p2) we 

get new roots: E; (p2), E2 (p2), Es (pz)... By choosing different values of 
p, we finally get a set of functions defined by equation (24.7): 


E, (p), Ez (p), Es (p)... Ej (p) (24.8) 


For every index j, the energy is a continuous function of the momentum. 


By assuming that these functions are bounded we write for the index j 
Et < Ej (p) < Er (24.9) 


The inequalities (24.9) include the energy values for the microparticle 
which constitute the jth energy band. If E7 < prn we get a region of 
unattainable energy values between the (j — 1)th and jth energy bands. 
This region is usually called the forbidden band. 


Thus, the energy spectrum of an electron in a periodic field must con- 
sist of a number of energy bands, some of which may be separated by 
forbidden bands. Within every energy band the energy varies continu- 


ously it is described by some continuous function E; (p). 


By replacing p by p+ 2zfin/a, the system (24.6) transforms into itself. It 


follows hence that 
2nhn 
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Since the above replacement does not change anything, we can say that 
the momentum p has physically different values within the limits of the 
band: 
h 
-r- < p< n- (24.11) 
a 


In other words, the p-space is split into intervals of length anh and one 
has to consider p only within the limits of one individual Ea, These 
intervals are called the Brillouin zones. In this case we are dealing with 
one-dimensional Brillouin zones. In general, the Brillouin zones are three- 
dimensional; they often have a very complex configuration, which reflects 


the specific nature of the periodic field under consideration. 


The band structure of the energy spectrum is characteristic of an 
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electron moving in the periodic field of a crystal lattice. The concepts of 13Ziman, J. M. (1972). Principles of the 


energy bands and Brillouin zones form the basis of the modern electronic 


14) University Press, London 


theory of solids (see, for example, Ziman!’ and Jones 


E 


Crystals. North-Holland, Amsterdam 





Figure 24.1: Brillouin zone and Bloch 
functions. 


BLocH Functions. Let us consider the jth energy band. Figure 24.1 shows the dependence E; (p) 
for this band. We choose some value E? from this band and denote by p? the corresponding value 


Figure 24.1 of the momentum for motion to the right. The wave function of the chosen stationary 


Theory of Solids (2nd edition). Cambridge 


14 ones, H. (1960). The Theory of 
Brillouin Zones and Electronic States in 


state is denoted through 1) (p). It can be easily seen that this function differs from zero only for p = 


p? + 2nhn/a [it can be seen from the figure that only at these points does the curve E;(p) intersect the 
straight line E = E°]. Hence the function ty (p) may be written in the form 
2nhn 
ty (p) = 5 Q (p) ô [e Pr a -») (24.12) 
n 


Next, we go over to the coordinate representation by using the familiar rule (23.4): 


+00 
0 1 0 ; 
a) == f ) exp(ipx/ħ) d 
TN S S RA 
Substituting (24.12) into this, we get 


1 2zh 2ah 
gy (x) = F 2 Q (e + = =) exp | (e + ad 7 
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By introducing the notation 


(2nh)—1/2 > Q (e + | exp (==) = u9 (x) (24.13) 








i 
a a 
we can rewrite the above result in the following form: 
0 0 ip?x 
(x) =u; (x) exp (= 


The explicit form of the function ui (x) is not known [ in order to know it, we should have known the 
explicit form of U(x)]. However, it can be seen from (24.13) with the period of the that the function 
u5 (x) is periodic field: 

us (x+a)= uS (x) (24.14) 
Thus the wave function of a stationary state given by the indices j and p has the following form in 


coordinate representation [cf. (22.24)] 


Pip(x) = Ujp(x) exp (=>) (24.15) 


This is a plane wave [exp(ipx/h)] whose amplitude [ujp(x)] is periodic with the period of the field. 


The functions (24.15) are referred to as Bloch functions in the literature. 


Let us consider the motion of a particle is a field whose potential is The Kronig-Penney Potential 
shown in Figure 24.2 (the so-called Kronig-Penney potential). This is the 


simplest case of a periodic potential. 


Figure 24.2 shows three spatial regions. Assuming first Figure 24.2 that U Ze [2 | 
E > Uo we write the solution equation (20.13): aes = 
-6 ô a B46 


for region 1 
Figure 24.2: Periodic Kronig-Penney 


@1(x) = A; exp(ikyx) + Bı exp(—ik,x) potential. 
42m(E — U, 
k= m( 0) 
h 


for region 2 


@2(x) = Az exp(ikgx) + B2 exp(—ik2x) 


The solution for region 3 may be expressed in terms of the solution for 
region 1 by using results obtained previously. Let us take a certain point x 


of region 3. According to (24.15), we can write 


3(x) = u(x) exp (=) (24.16) 


The symmetric point x — l in region 1 corresponds to the chosen point. At 
the point x — l we have 
ip (x — 1) 


pi(x — 1) = u(x — I) exp (25>) 


LINEAR OPERATORS IN QUANTUM MECHANICS 


In accordance with (24.14), we rewrite this equality in the form 


g1(x —1) = u(x) exp (z=) (24.17) 


From (24.16) and (24.17) we find 


or, finally, 


s(x) = exp (=) {A1 exp [iki(x — 1)] + Bı exp [-iki(x-D]} (2418) 


By using (24.18) and the expression for ~; and @2, we can write the conti- 
nuity conditions for the wave function and its first derivative at the points 
corresponding to the potential jump (the points x = 0 and x = a). These 
conditions form a homogeneous system of linear equations in terms of the 
coefficients A4, B,, Az, Bo: 


A, + By = A + Bo 
A; exp (ikza) + By exp (—ikza) = exp (ipl/h)[A, exp (—ik,b) + B, exp (ik,b)], 
ky Ay — ky By = ky Ag — k2B2, 
k2[A> exp(ikza) — B2 exp(—ikza)] = exp(ipl/h) [Ai exp(—ikıb) — Bı exp(ikib)|ky 
By equating the determinant of this system to zero, we get the following 
system (we omit the intermediate steps): 
24 72 


pl ki + k; : s 
cos | — | = cos(kza) cos(k;b) sin(k2a) sin(k,b) (24.19) 
h 2ky ke 





Since the modulus of the cosine cannot be greater than unity, we get the 


following condition imposed on quantities kı and kz and, hence, on E: 


2 2 


ke t+ks\ | ; 
cos(k2a) cos(k,b) — | i 2) sin(kz2a) sin( hsb) <1 (24.20) 


-1< 
2k ko 








This condition defines the allowed energy bands. We next consider the 


case when E < Uo. 


Now k; is an imaginary quantity. Suppose kı = iks, where kz = 
/2m(Up - E) /h. Since by replacing k, by ik; the cosine and the sine 
are converted into hyperbolic cosine and hyperbolic sine, respectively 
[cos(k,;b) — cosh(k3b), sin(k,b) — i sinh(k3b)1, we can make use of the 


result (24.19), which in this case assumes the form 





pl k- ki\ l 
cos | — | = cos(kza) cosh(k3b) + sin(kza) sinh(k3b) (24.21) 
h 2koks 


Accordingly, the condition (24.20) defining the energy bands is trans- 
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formed into the following form: 





k? — k2 
—1 < |cos(kz2a) cosh(k3b) + | T: ; 2) sin(kza) sinh(k3b)} <1 (24.22) 
2k3 


Let us consider a special case, when 
b«a(axl); E<« U (24.23) 


(the barriers are narrow and high). Since in this case the quantity b can 
become arbitrarily small, we can require that the following conditions be 
fulfilled: 

by2mUy/h «1, or kb<« 1 (24.24) 


By taking (24.24) into account, we put cosh(k3b) = 1 and sinh(k3b) = k3b, 
and besides, according to (24.23), 


ky ky ks 1 Uo 
2kək; 2k, 2\E 


As a result, (24.21) assumes the following form: 





còs (=) = F(a) (24.25) 


where we have used the notation 





Upb 
F(y) = cosy + E ) sin y (24.26) 
The condition (24.22) assumes the form 
-1 < F(kza) < 1 (24.27) 


The function F(kza) is shown in Figure 24.3. The parts of the kza axis, 
for which the condition (24.27) is satisfied have been shaded in the di- 


agram. They correspond to the allowed energy bands (remember that 


ky = V2mE/h). 


F Figure 24.3: Energy bands due to periodic 


44m Udi potential. 


T V 


LA 
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Using (24.25)-(24.27), we shall make two remarks. Firstly, if a — o0 
(transition to a free electron), cos (k,a) — cos (pa/h); this corresponds to 
a transition to the classical relation (23.16) for the energy and momentum 
of a particle. Secondly, it can be seen from Figure 24.3 that a discontinu- 
ity in the energy of the electron occurs when cos (pa/h) = +1, i.e. for 
pa/h = nz,where n is an integer. According to (24.11), this means that the 


discontinuities in the energy of the electron occur at the boundary of the 


Brillouin zone. Figure 24.4 shows the dependence of the electron energy 





E Figure 24.4: Brillouin zone and discontinu- 
ity in energy. 
on momentum as determined by (24.25). The discontinuities in the energy 
mentioned above are clearly seen (the energy bands are shown on the left 
side by hatching). For comparison the classical dependence of energy on 
momentum is shown for a free electron: 
2 
e 
2m 
The conversion of energy levels of an electron in the atom into energy Formation of Energy Bands 
bands of an electron “collectivized” by the crystal may be seen as an effect as an Effect of the Removal of 
of the removal of commutation degeneracy. Commutation Degeneracy 


As was mentioned in Section 22, the electron energy in an atom in 
the general case of a non-Coulomb potential is determined by the quan- 
tum numbers n and l. This gives rise to a 2(2! + 1)-fold degeneracy of 
this energy. By assuming that the atom is a part of some perfectly or- 
dered ensemble of N similar atoms and that it remains isolated from 
its neighbours, the degeneracy in the electron energy must be taken as 
2N(2l + 1)-fold. The factor N is associated with the so-called commuta- 
tion degeneracy: in an ordered ensemble, there are no physically isolated 
atoms, hence the energy of an electron cannot depend on any of the N 
atoms near which it is localized. However, in a real ensemble (i.e. a crys- 


tal), the atoms are not isolated — they interact with one another. This in- 
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teraction leads to a “collectivization” of the electron and a partial removal 
of degeneracy of its levels. The level with a 2N (21 + 1)-fold degeneracy 
is split up into a system consisting of N(2/ + 1) sublevels, each of which 
remains doubly degenerate (according to the spin quantum number øo). 
Thus a “collectivization” of the electron by the crystal leads to a removal 
of commutation degeneracy and degeneracy with respect to the quantum 


number m. 


It is significant that a system of N(2l + 1) sublevels is in reality not 
discrete, but forms a band of allowed values of the electron energy. In- 
deed, let AE be the width of this system of sublevels, and Ae the distance 


between the neighbouring sublevels: 


AE 
Ae = — (2l + 1) 
N 
For the system of sublevels to be discrete, it is essential Ae > —, where 
T 
t is the lifetime of an electron in a crystal. In other words, the distance 
between sublevels must be greater than the uncertainty in the energy of 


the sublevel described by the relation (3.2). This means that the condition 


n(2l+1)N 


24.28 
hE (24.28) 


must be satisfied. Assuming that N(2/ +1) ~ 1x 10% , AE = 1eV, we 
find that r should be more than 1 x 10° s, i.e. more than 10 years. Since the 
real lifetime of a “collectivized” electron in the crystal may only be less 
than this, the relation (24.28) is obviously not satisfied. This enables us to 
consider the system of N(2l + 1) sublevels as an energy band. Naturally 
the number of electron states in the band remains finite-the band may 
“accommodate” up to 2N (2l + 1) electrons. In this connection we speak of 


the degree of filling in of a band, of completely filled bands, etc. 


25 The Probability of Quantum Transitions 


We assume that a microparticle undergoes a transition from one 
stationary state to another under the action of some external factor. How 
to find the probability of such a transition? The initial and the final states 
of the microparticle are described by functions of the type (20.16). For 


example, let the initial state be given by 





W(x, t) = Pn (x) exp (-") 


For simplicity, the set of spatial coordinates is denoted by x, The functions 


Y, satisfy the Schrödinger equation (20.17): 


9 ` 
it Yn = HY, (25.1) 


Quantum Transitions and the 
Principle of Superposition of 


States 
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(we shall call it the “unperturbed” equation). The physical nature of ex- 
ternal factor, which causes the quantum transition of the microparticle, is 
arbitrary. In particular, it may be the interaction of the microparticle with 
electromagnetic radiation. In the quantum theory apparatus such a factor 
appears in the form of an interaction potential which must be added to 
the “unperturbed” Hamiltonian H. In Section 22 in the example of the in- 
teraction of an electron with radiation, this “addition” to the Hamiltonian 
was interpreted as some perturbation and was denoted by H’. We shall 
use the same notation here. By taking into account the perturbation H’, 


we can rewrite the Schrédinger equation in the form 
ð fu 
ins On =(H+H’')®, (25.2) 


This equation is called the “perturbed” equation. Its solutions ®, are no 
longer stationary states. Hence the index n here does not fix the energy 
level, but merely indicates the past history: the given “perturbed” state 


has “emerged” from nth “unperturbed” state. 


Essentially, the state ®, is a superposition state: 
Bn(x,t) =)” xnk(t) Pe(xs €) (25.3) 
n 


If at a certain time t we “switch on” the corresponding detector, the super- 
position (25.3) is destroyed and the microparticle will be observed in one 
of the stationary states, for example, in the state Ym. This means that the 
microparticle has undergone a quantum transition from the state ¥,, to 


the state Ym. It is well known that the probability of such a transition is 
[Xam Ð)’. 


The reader is in fact familiar with all this. The remarks made above are 
in agreement with those made in Section 10, regarding the relation (10.3), 


which is essentially equivalent to the, relation (25.3). 


Thus, the transition probability is, as expected, the square of the modu- 


lus of the corresponding transition amplitude: 


Wnm = l¥nm(t) |? (25.4) 


This amplitude is one of the coefficients in the superposition (25.3), which 

is just an expansion of the “perturbed” state ©, in terms of “unperturbed” 

states. In order to find the probability wpm, we should first solve equa- Transition to the Energy Repre- 
tion (25.2) and then find the expansion coefficients of the solutions ob- sentation 

tained for states of the type (20.16). As can be easily seen, such an ap- 

proach corresponds to the first method in the “scheme” of transition from 

one representation to another, considered in Section 23. The other ap- 

proach corresponding to the second method in this scheme, however, is 


more rational. We shall follow the second method here. 
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According to (15.5), the coefficients ynk of the superposition (25.3) may 
be treated as a wave function describing the “perturbed” state, though 
not in the coordinate representation (in the coordinate representation 
this is done by the function ®,,), but in the representation of a set of those 
physical quantities with respect to which the functions % are eigenfunc- 
tions. Since the functions , are eigenfunctions of the Hamiltonian, we 
call this representation the energy representation. Thus, following the 
second method in the “scheme” in Section 23, we must “translate” the 
“perturbed” equation (25.2) given in the coordinate representation into 
the energy representation. This will give us a new equation — the “per- 
turbed” Schrödinger equation in the energy representation. The solutions 
of this equation will be the required transition amplitudes. Performing the 
operations indicated above, we substitute (25.3) into (25.2), and take into 


account (25.1). We get 





k d nk Ay 
p2 7 Y= 2, XnkH Yk (25.5) 


Premultiplying both sides of (25.5) by ‘¥;, and integrating with respect to 


the spatial coordinates, we obtain 


. d * * ry 
AD g [Mie Med = Y xu [YAO Wed 


Taking into account the orthonormalization of stationary states, we can 


rewrite the last expression in the following form: 

. d * Ty 

in Xnm = 2, Xnk | Vy HV dx (25.6) 
Next, we introduce the expression 


fee We dx = exp (iome!) | Omi pe ax 


= exp (iomkt) (m | H’ | k) 


(25.7) 


where 
Em = Ex 
h 


Taking into account (25.7) we may rewrite (25.6) in the following final 


Omk = (25.8) 


form: 
d 


qanm = a Dyin (m | H’ | k ) exp (i@mkt) (25.9) 


Thus, we have obtained the “perturbed” Schrédinger equation in the 
energy representation. In fact (25.9) is not one equation, but a system 
of equations. We emphasize that the system (25.9) is more convenient 
than (25.2), since by solving this system we can at once determine the 
transition amplitudes Ynm. It is also quite significant that when solving 


the system (25.9), the perturbation method can usually be applied. 
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The perturbation is usually very small, which enables us to obtain Application of the Method of 
an approximate solution of the system (25.9) by using the method of Perturbations to Computation 
perturbations. A small perturbation means that the function ®, may be of the Transition Probabilities 


represented in the form 
©, =¥,+An (25.10) 


where A, is a small addition to the unperturbed function ¥„. In accor- 
dance with (25.10), we can write 
Xnk = Onk + 47 +70) + ar (25.11) 


It can be easily seen that 
2 3 
A= oy 1 +O +O + weal] Fk 
k 


According to (25.11), the small addition A is in turn split into additions dif- 


(1) 


fering in the order of smallness: the functions y,¢ are of the same order 


as the perturbation, ee ) 


Substituting (25.11) into (25.9), we get 


d d d 
(1) (2) (3) 5 (1) (2) 4 
grmt Gprnm + 7 Xnm -> = nk + Xak +X T sarl 


x ( m | H’ | k) exp(iamet) 


are of the order of the square of perturbation, etc. 





(25.12) 


We rewrite (25.12), retaining terms of only first order in the perturbation: 


d o 


i s, f 
Pre O ( m | H | 1) exp(i@nmt) (25.13) 


This is the approximate expression for the amplitudes y,,,, obtained in 


the first-order approximation in the method of perturbations. 


If it turns out that ( m | H’ | n) = 0, we must use the approximate 
expression for the amplitudes in the second-order approximation in the 
method of perturbations. It is obtained from (25.12) by retaining terms of 


the second order in the perturbation: 


B= ( a ; 
ox h 2 (m | F | n) expCiomkt) (25.14) 
Similarly, for the third-order approximation in the method of perturbations 


we get 


Ti TO a 2 m | H’ | n) exp(iamet) (25.15) 


etc, 
Knowing the transition amplitudes for any order in the method of 


perturbations, we may obtain the transition probabilities in the corre- 


sponding approximation. In the first approximation, we get, from (25.13), 
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2 


t 
wi) = (m | A’(t) | n) exp(i@mnt) dt (25.16) 


2 
= [xin =a 








For the second-order approximation in the method of perturbations we 


get, from (25.14), 


2 
Woe z Df XL O) (m | Ê) | k) expliomkt) dt (25.17) 
etc. 


Note that the result (25.17) describes the interference of amplitudes. 
The resulting transition amplitude here is the sum of the transition ampli- 
tudes through various intermediate states. The impossibility of finding the 
microparticle in any intermediate state leads to the indistinguishability 
of alternatives and permits us to speak of intermediate states as virtual 


states. 


The further course of action envisages a substitution of definite opera- 
tors H’ in expressions of the type (25.16) and (25.17). In quantum electron- 
ics, for example, the operator (22.29) is used. A detailed consideration of 
such questions is beyond the scope of this book. The reader is advised to 


16 


refer, for example, to Fermi!” and Tarasov?®, in this connection. 


In conclusion, we return to the Feynman’s diagrams considered in 
Section 6 (see the diagrams in Figure 6.1, relating to the scattering of 
one electron by another). Note that all the diagrams shown in Figure 6.1 
are associated with the same quantum transition — the transition of two 
electrons from particular initial states to particular final states. Strictly 
speaking, each particular transition must be described by an infinite 
number of diagrams, with an ever increasing number of vertices. In this 


connection, we recall the remark expressed in Section 6: 


In order to calculate the probability of scattering of an electron by an 
electron, it is necessary, in principle, to take into account the contribution 


(to this transition) of various processes described by different diagrams. 


Returning to Feynman’s diagrams, we can now clarify the meaning of 
the remark quoted above. We can now explain the meaning of “taking 
into account” of the contribution from different diagrams. What happens 
is that to each diagram there corresponds a definite transition amplitude. 
The resulting amplitude is the sum of these amplitudes. Consequently, 
in order to compute the probability of a particular quantum transition, 
it is necessary firstly to form all the possible Feynman’s diagrams of 
the transition and write the amplitudes corresponding to the various 


diagrams, and, secondly, sum all these amplitudes and find the square of 


15 Fermi, E. (1961). Notes on quantum me- 
chanics. University of Chicago Press 


16Tarasov, L. V. (1976). Physical Founda- 
tions of Quantum Electronics (in Russian). 
Sov. Radio, Moscow 


Feynman’s Diagrams and 
the Calculation of Transition 


Probabilities 
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the modulus of the sum. 


In Section 6 we had the sentence: “Fortunately, the contribution of dif- 
ferent processes (different diagrams) is different? This means that in prac- 
tice the method of perturbations is used, which allows us to restrict the 


summation of amplitudes to just the first few terms. The dimensionless 
n 
ove 
— |“, mentioned in Section 6, is the factor which in the above- 
c 
mentioned sum of amplitudes contains the amplitudes corresponding to 


quantity (; 


the diagrams with n vertices. The smallness of this quantity explains the 


applicability of the method of perturbations in quantum electrodynamics. 


It may be said that the quantum-mechanical idea of interference of 
transition amplitudes along with the method of perturbations forms, from 
a most general point of view, the foundation of quantum electrodynamics 


as a quantum theory. 


26 Ways of Describing Evolution of Microsystems with Time 


To conclude the book, let us discuss in very general terms the very 
important problem of the methods of describing the evolution of micro- 
systems with time. The quantum-mechanical apparatus so far developed 
uses three different methods. Three different forms of the equations of 
motion in quantum mechanics correspond to these methods. We shall 
discuss these methods below. Usually we refer to them as three different 
representations: the Schrödinger representation, the Heisenberg represen- 
tation and the interaction representation (Dirac representation). The term 
“representation” is used here in a different, wider sense than in the previ- 
ous sections. We may also speak of Schrédinger’s coordinate representa- 
tion or Schrédinger’s momentum representation. In the previous sections 
we restricted ourselves to the Schrödinger representation but considered 
various representations, understood in a more restricted sense - the co- 
ordinate, the momentum and the energy representations. We shall now 
consider Schrédinger, Heisenberg and Dirac representations, restricting 


ourselves to the coordinate representation. 


Here, the evolution of a microsystem with time is described as an Schrodinger Representation 


evolution of the amplitudes of states of the given microsystem: 
Y (x, t) = U(t, to) Y (x, to) (26.1) 
The operator U(t, to) satisfies the conditions 


U(t,to) =1 (26.2) 
vut =utt =1 (26.3) 


The condition (26.2) is obvious. The condition (26.3) of unitarity is a direct 
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consequence of the independence of the normalization of amplitudes of 


state from the choice of the instant of time: 


ii V(x, t) W(x.t) dx = | W(x t0) We to) dx 


The quantum-mechanical equation of motion in the Schrédinger represen- 


tation (corresponding to Schrédinger’s equation) is of the form 
a i 
ihe W (x,t) = A(x) y (xt) (26.4) 


where the operator H does not depend on time. By using (26.4) and (26.2), 
we can easily find the form of the operator U(t, to): 


G(t, to) = exp -i (t= to) f) (26.5) 
The exponent should be treated here as an expansion into a power series. 


The evolution of a microsystem with time is described as an evolution 
of Hermitian operators describing a given microsystem. The amplitudes of 


states are treated as time-independent: 


¢ (x, t) = 9 (x, to) (26.6) 


The transition from amplitudes in the Schrédinger representation to 
amplitudes in the Heisenberg representation is carried out, as can be seen 


from (26.1), with the help of the operator U' (t, to): 


o (x, t) =U" (t, to) Y (t, to) (26.7) 
For, substituting (26.1) into (26.7), we get 


¢ (x, t) =U'U = Y (t, to) 
= U(t, to) 9 (x, to) = 9 (x, to) 


Let L(x) be the operator of some physical quantity in Schrédinger’s repre- 
sentation. According to (17.32) and (26.7), this operator in the Heisenberg 


representation will have the form 
L (x, t) = ÛŻ (t, to) L(x) U'(t, to) (26.8) 


Substituting (26.5) into (26.8), we find 


L (x, t) = exp Fe — to) n| L(x) 
(26.9) 
X exp 5 — to) f) 


Heisenberg Representation 
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Differentiating (26.9) with respect to time, we get 
ing L(x, t) = exp K — to) f) L(x) H(x) 
x exp E = w) AG)| — exp Fe 7 w) | (26.10) 
x H(x) L(x) exp -że =i) f| 
This may be written in the form 
ine L (x, t) = U'L(x) H(x)U - U'A(x) L(x) U (26.11) 
or, by taking into account (26.3), in the form 
ins L(x, t) = U'L(x) UU'A(x)U - OT A(x)UU'L(x) U (26.12) 


Using (26.8) with respect to the operator L(x) as well as the operator if 
H(x), we get the quantum-mechanical equation of motion in the Heisen- 


berg representation [cf. Eq. (19.10)]: 


ð a P : 
in L(x, t) = [Ê (x, t), É (x, t)] (26.13) 
We note that at the time to we have ọ (x, to) = Y (x, to) and L(x, to) = Comparison of Schrödinger 
L(x), ie. both the amplitudes of states and the operators coincide in and Heisenberg Representa- 
these two representations. However, at subsequent moments of time, we tions 


encounter two different situations. In the Schrödinger representation the 
amplitude of state changes, while the operator remains the same as it was 
at the moment to; in the Heisenberg representation, on the contrary, the 
operator changes, while the amplitude of state remains the same as it was 
at the moment fo. It could be said that in the Schrödinger representation 
the dependence on time is shifted to the amplitudes of states, and in the 


Heisenberg representation, to the operators. 


For practical calculations, it is usually more convenient to use the 
Schrédinger representation. However, the Heisenberg representation has 
the advantage that it permits us to draw a mathematical analogy between 
quantum mechanics and classical mechanics. In Heisenberg’s representa- 
tion the quantum-mechanical relations have the form of classical relations, 
in which the physical quantities are replaced by operators (in this connec- 


tion, we recall the last item of Section 20). 


Schrödinger, when he introduced in 1926 the representation now 
named after him, considered the time-dependent amplitudes of states 
as the amplitudes of certain waves, and thus gave birth to the subject of 
“wave mechanics”. A year earlier, Heisenberg proposed his method of 
describing the evolution of micro-systems with time, thus introducing for 


the first time equation (26.13) in matrix form. This gave rise to the sub- 
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ject of “matrix mechanics”. The difference between both these subjects is 
reduced to the above-mentioned difference between the Schrédinger and 
the Heisenberg representations, i.e. it is of a strictly formal, mathematical 


nature. 


Suppose that the Hamiltonian of a microsystem can be broken into 
two components, one of which (Ho) represents the Hamiltonian of the 
microsystem itself, and the other (H;) describes the interaction of the 
initial microsystem with external fields or other systems (in other words, 


is “responsible” for the effect of perturbation of the initial microsystem) : 
H(x, t) = Ho(x) + Hi (x, t) (26.14) 


In this case, it is convenient to use the interaction representation intro- 


duced by Dirac. 


The amplitude of state ¥(x, t) in the interaction representation is 
expressed through the amplitude of state y(x, t) in the Schrödinger repre- 


sentation in the following way: 


W(x, t) = exp jiwo -— n) y(x, t) (26.15) 


In accordance with (17.32) we find from here the form of the operator 


Lg(x, t) in the interaction representation: 
Îs(x, t) = exp Re z i) f| L(x) x exp Hz = in) | (26.16) 


We emphasize that in (26.15) and (26.16) we have used not the entire 
Hamiltonian, but just its “unperturbed” component Ho(x). Differentiating 


(26.15) with respect to time, we get 
gp 4 l i ; ə 
ih 5 t) = —Ho(x) ¥(x, t) + ihexp Fe - i) Fax) Prades t) 


Since 2 
ih 5 YO t) = [ Ho (x) + Ay (x, t)| y(x, t) 


the last result can be written in the following form: 
90 i à A 
ih 5 1 t) = exp K — iF) Hı (x, t) W(x, t) 


= exp K — to) fao) Hy exp Hz — to) fao) W(x, t) 


= ĤP (x, t) ¥(x, t) 
(26.17) 


Combining the results obtained above, we write the quantum-mechanical 


Interaction Representation 


(Dirac Representation) 
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equations of motion in the interaction representation as 


ih “ Lg (x, t) = [Lp (x, t), Ho (x, t)] (26.18) 
ih AT t) = P (x, t) ¥(x, t) (26.19) 


[note that (26.18) is obtained as a result of differentiation of (26.16) with 


respect to time]. 


Thus, in the interaction representation, the dependence of the am- 
plitude of state on time is determined by the interaction (perturbation) 
Hamiltonian H8, while the time dependence of the operator is deter- 
mined by the “unperturbed” Hamiltonian Hp. In this sense the interaction 
representation lies in between the Schrédinger and the Heisenberg repre- 


sentations. 


The vector analogy enables us to compare quite clearly all the three 
representations considered above. We correlate the system of basic states 
of the microparticle with the system of mutually orthogonal basic vectors 
in some arbitrary space. We shall consider all operators in matrix form 
defined by the system of basic vectors. The states of the microparticle are 
described by vectors considered in the coordinate system defined by the 
basic vectors. Thus we have a system of basic vectors and a set of vector 


states to be considered relative to this system. 


We now turn to various representations. In Schédinger’s represen- 
tation the evolution of a microparticle with time assumes a rotation of 
the vector state relative to the stationary system of basic vectors. In the 
Heisenberg representation the evolution of a microparticle with time as- 
sumes, on the contrary, a rotation of the system of basic vectors relative 
to the stationary vector state. Finally, in the interaction representation we 
assume a rotation of both the system of basic vectors and the vector state. 
Moreover, the rotation of the vector state is caused exclusively by the 
interaction of the microparticle with the external fields (perturbation) and 
is therefore dynamic in nature. The rotation of its system of basic vectors, 


however, is not related to the external factors, and is kinematic in nature. 


The methods of describing the evolution of microsystems with time 
considered above are based on the use of the quantum-mechanical equa- 
tions of motion. They assume a continuous evolution in time of either 
amplitudes of states or of certain Hermitian operators, or simultaneously 
of the amplitudes of states and the operators both. However, there are 
other qualitatively different processes. Thus, it is well known that a de- 
struction of the superposition of states caused by the detector in the act 
of measurement leads to an abrupt change in the amplitude of state. It is 
obvious that this change in the amplitude does not follow any equation of 


motion and obeys only probabilistic predictions. 


On the Vector Analogy Again 


One Additional Remark 
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In this connection two types of process are distinguished in quantum 
mechanics: processes associated with a continuous change in the ampli- 
tude of state in accordance with the equation of motion, and those con- 
nected with abrupt, unpredictable, unambiguous changes in the amplitude 
of state in the act of measurement. In the existing quantum-mechanical 
apparatus processes of the first kind are mainly involved. A considerable 
increase in recent times in the interest in quantum-mechanical problems 
of measurement indicates the starting point of serious research into pro- 
cesses of the second type (in this connection see the remarks about the 


HY, 17 Wigner, E. P. (1970). | Symmetries and 


Reflections. Indiana University Press, 
Bloomington- London 


problem of measurement in quantum mechanics by Wigner 


It is often believed that the probabilistic interpretation of quantum me- 
chanics is limited by the introduction of probability amplitudes, since the 
latter can be predicted unambiguously from a solution of the Schrödinger 
equation. Such a belief, as can be easily seen, is associated with a fact 
that the above-mentioned processes of the second type are not taken into 
account. Undoubtedly, these processes intensify and complicate the prob- 
ability aspect of quantum mechanics since they point to the necessity 
of considering a sort of “secondary probability” — the probability of the 
realization of the amplitude of probability. 


It must be admitted that the theory of quantum-mechanical measure- 
ment is still far from complete; processes of the second type have not 
been sufficiently incorporated into the apparatus of quantum mechanics. 
This means that modern quantum mechanics, in spite of its strictness and 
indisputable mathematical beauty, “conceals” unsolved problems, which 


predetermine its further development as a physical theory. 


On the History of Origin and Growth 
of Quantum Mechanics (A Brief 


Historical Survey 


Nineteenth century was an era of rapid growth in physics. It is enough to “The Crisis in Physics” 
mention just a few areas: the achievements in electricity and magnetism 
which led to Maxwells electromagnetic field theory and permitted the 
inclusion of optics into the framework of electromagnetic phenomena; 
the significant progress in the development of classical mechanics which 
came close to perfection as the result of a number of brilliant mathemat- 
ical works; the enunciation of many universal principles in physics, of 
prime importance among them being the law of conservation and trans- 
formation of energy. It is not astonishing that towards the end of the 

19" century it was generally believed that the description of the laws of 
nature was in a final stage. In this respect the famous remarks of Planck 
are worth noting. After defending his Ph.D. thesis, Planck wrote to his 
teacher and mentor Philip Jolly asking his advice as to whether he should 


seek a career in theoretical physics. 


“Young man”, replied Jolly, “Why do you want to ruin your life? The the- 
oretical physics is practically finished, the differential equations have all 
been solved. All that is left now is to consider individual special cases in- 
volving variations of initial boundary conditions. Is it worthwhile taking up 


a job which does not hold any prospects for the future?” 


In the August of 1900, the celebrated mathematician Hilbert presented 
his famous 23 problems before the Second International Congress of Math- 
ematicians. One of these problems (the sixth one) pertained to axiomation 
in physics. Hilbert proposed that a finite number of initial axioms should 
be formulated so that all the results needed for a complete description of 
the physical picture of the world could be obtained from them by purely 
logical means. The very fact that such a problem was stated speaks in 


most convincing manner of the belief held by scientists at the time that a 
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final description of physical phenomena was at hand. 


Events which followed dispelled such illusions very soon. At the turn 
of the 20"" century a number of fundamental discoveries which could not 
be contained within the framework of the existing theories in physics 
were reported. The list of these discoveries was quite imposing: X-rays, 
the dependence of the mass of an electron on its velocity, the incompre- 
hensible laws of the photoelectric effect, radioactivity, etc. It appeared 
that nature had decided to “laugh” at the self-confidence of people who 
thought they had uncovered all its secrets. 


Such an unexpected turn of events forced many physicists and philoso- 
phers to speak of a collapse of the earlier foundations, of the impossibility 
of knowing everything about matter, of the absence of objective laws of 
nature, of the “disappearance of mass”, etc. The earlier unanimity was 


replaced by sharp differences of opinion about fundamental ideas. 


In his book “Materialism and Empirio-criticism” published in 1908, V I. 
Lenin called this period in the development of physics a period of “crisis 


in physics”: 


The essence of the crisis in modern physics consists in the break-down of 
the old laws and basic principles, in the rejection of an objective reality 
existing outside the mind, that is, in the replacement of materialism by ide- 
alism and agnosticism. ‘Matter has disappeared’ - one may thus express 
the fundamental and characteristic difficulty in relation to many particular 


questions which has created this crisis. 


Analysing the causes that led to this crisis, Lenin wrote: It is mainly 
because the physicists did not know dialectics that the new physics strayed 


into idealism. 
Defending the dialectical point of view, Lenin emphasized that 


‘Matter disappears’ means that the limit within which we have hitherto 


known matter disappears and that our knowledge is penetrating deeper. 


Lenin pointed out that the period of crisis will culminate in a new leap 
in the development of physics and that its further development will occur 


on the lines of materialistic dialectics. He wrote: 


Modern physics is in travail; it is giving birth to dialectical materialism. 


The process of child-birth. is painful. 


Looking back, we can now say that this “travail” led also to the birth of 
quantum mechanics. As Lenin envisaged, the overcoming of this “crisis 
in physics” resulted in a deepening of our knowledge of matter, and de- 


manded a decisive turn from metaphysical to dialectical ideas. This was 


reflected most clearly in the quantum-mechanical concepts. We have ev- 
ery reason to associate quantum mechanics with a qualitative leap in our 


knowledge about nature (see Section 16). 


Considering the history of the initial period of quantum mechanics, we 


can isolate three distinct stages. 


§ The first stage: end of 19th century-1912 (first experiments and first 


attempts to explain them). 
§ The second stage: 1913- 1922 (Bohr’s quantum theory). 


§ The third stage: 1923- 1927 (establishment of quantum mechanics). 


We shall now proceed to study these three stages in detail. 


The foundations of quantum mechanics were laid by experiments 
conducted at the end of the 19th century and the beginning of the 20th 
century in various branches of physics which at that time were not con- 
nected with one another, e.g. atomic spectroscopy, study of black body 
radiation and the photoelectric effect, solid state physics, study of the 
structure of atom. By the end of the 19th century a lot of experimental 
material on the radiation spectra of atoms was accumulated. It turned 
out that atomic spectra are ordered sets of discrete lines (series). In 1885, 
Balmer discovered a series of lines of atomic hydrogen, later named after 
him, that could be described by a simple formula. In 1889, Rydberg found 
a series of lines for thallium and mercury. Extensive studies of the spectra 
of different atoms were conducted during this period by Kaiser and Runge 
who used photographic methods. In 1904, Lyman discovered a series of 
hydrogen lines falling in the ultraviolet region of the spectrum and in 
1909, Paschen found a hydrogen series in the infrared region of the spec- 
trum. Remarkably, the Lyman and Paschen series could be described by 
a formula which was very close to the one established earlier by Balmer. 
Noticing the regularities among various series of an atom, Ritz in 1908 for- 
mulated his famous combination principle (see Section 2). However, right 
until 1913, this formula could not be explained, the nature of the spectral 


lines remained unclear. 


Wien in 1896, studying black body radiation, derived a formula which 
accounted for the experimental results obtained for high radiation fre- 
quencies quite accurately (Wien’s law). This formula, however, was 
not applicable for lower frequencies. In 1900, Rayleigh proposed a for- 
mula which agreed fairly well with experiments for low frequencies (the 
Rayleigh-Jeans law) but led to absurd results for higher frequencies (this 
situation was known as ‘ultraviolet catastrophe’). In the same year Lum- 
mer and Pringsheim conducted detailed experimental studies over a wide 


frequency range. In order to explain the results obtained by Lummer and 
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First Experiments and First 
Attempts to Explain Them 
(end of 19th century-1912) 
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Pringsheim, Planck proposed his famous empirical formula which trans- 
formed into Wien’s and Rayleigh’s formulae in the corresponding limiting 


Cases. 


The formula proposed by Planck contained a certain constant, which 
he called an elementary quantum of action (i.e., the Planck’s constant h). 


According to Planck?®, 18 Planck, M. (1955). A Survey of Physics: 
A collection of Lectures and Essays by Max 
Planck, chapter The origin and develop- 
ment of the quantum theory. E. P. Dutton 
the deductions from the radiation theory were largely illusory and were and Co., NY 


Either the quantum of action was a fictitious quantity, in which case all 


nothing more than mathematical juggling. Or the radiation theory is 
founded on actual physical ideas, and then the quantum of action must 
play a fundamental role in physics, and proclaim itself as something quite 
new and hitherto unheard of, forcing us to recast our physical ideas, which, 
since the foundation of the infinitesimal calculus by Leibniz and Newton, 


were built on the assumption of continuity of all causal relations. 


After a careful consideration of his formula, Planck arrived at a bril- 
liant conclusion: it must be assumed that every radiating atom in a solid 
emits energy only discretely, in quanta, the energy of an individual quan- 
tum being equal to fiw. This resulted in the historic paper by Planck 
Theory of the Law of Distribution of Energy in a Normal Spectrum, which 
he presented before the Berlin Academy of Sciences on 14th December, 
1900. This day may, in fact, be considered as the birthday of quantum 


mechanics. 


Planck’s discovery was in sharp contradiction with the classical theory. 
It must be admitted that this was at first disturbing for Planck himself. 
In an attempt to reconcile his discovery with classical concepts, Planck 
proposed in 1911 a hybrid conception. According to this conception, 
only the emission process in radiation is discrete while the processes of 


propagation and absorption of radiation are continuous ones. 


Planck’s hybrid hypothesis got no recognition. Earlier, in 1905, Ein- 
stein gave a brilliant explanation of all the laws of the photoelectric effect 
known at that time by assuming that light is emitted as well as absorbed 
discretely. Later, in 1917, Einstein came to the conclusion that a light 
quantum possesses not only a definite energy but also a definite momen- 


tum equal to fiw/c. 


In 1907, Einstein successfully applied the idea of quantization to solve 
an important problem in solid-state physics which had baffled scientists 
for many years. Physicists in the 19th century had faced a violation of the 
classical law by Dulong and Petit. It was observed that the specific heat 
of solids is not constant but decreases upon a considerable lowering of 
temperature. As an example, let us consider the experiments conducted 


by Weber in 1875. He discovered a temperature dependence of the specific 
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heat of boron, carbon and silicon. The fact that specific heats of solids de- 
pend on temperature could not be explained within the framework of clas- 
sical theory until the appearance in 1907 of Einstein’s paper Correlating 
Planck’s Radiation Theory and the Specific Heat Theory. Applying Planck’s 
idea of quantization of energy to atomic vibrations in crystals, Einstein 
deduced a formula which, in agreement with experiment, described the 
temperature dependence of the specific heat of solids. Einstein’s work Iaid 


the foundation of the modern theory of the specific heat of solids. 


Finally, we must mention studies of the atomic structure whose origin 
can be traced to the year 1904 when Thomson proposed a model for the 
hydrogen atom in the form of a uniformly positively charged sphere 
with one electron at the centre. Later, Thomson came to the conclusion 
that the number of electrons in an atom must be proportional to the 
atomic weight and that the stability of the atom is impossible without the 
motion of the electrons. In 1908 Geiger and Marsden began studies on 
the scattering of a-particles passing through thin foils of different metals. 
They discovered that most of the a-particles pass through the foil without 
being scattered while some a-particles, roughly one in ten thousand, are 
strongly deflected (by an angle greater than 90°). In 1911, Rutherford came 
to the conclusion that the strong deflection of an a-particle occurs not as 
a result of many collisions, but in a single act of collision with an atom 
and, consequently, there must be a small positively charged nucleus at 
the centre of the atom, containing almost the entire mass of the atom. 
This was a decisive step towards the creation of the planetary model of the 


atom which Rutherford finally formulated in 1913. 


Thus, from the end of the 19th century to the year 1913 a large num- 
ber of experimental facts, which could not be explained on the basis of 
existing theory, were accumulated: the discovery of ordered series in 
atomic spectra, the discovery of the quantization of energy in black body 
radiation, the photoelectric effect, and the specific heat of solids; also 
the planetary model of the atom was created. However, until 1913, all 
these discoveries were considered separately. It was Bohr’s genius that 
understood the common character of these facts and created a fairly har- 


monious quantum theory on the atom based on these facts. 
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Bohr’s famous paper On the Constitution of Atoms and Molecules ap- Bohr’s Quantum Theory (1913- 


peared in 1913. It considered the theory of the planetary model of the 1922) 
hydrogen atom based on the idea of quantization (the energy and the an- 

gular momentum of an electron in an atom were quantized). Resolutely 

departing from accepted concepts, Bohr’s theory ruled out a direct link 

between the frequency of the radiation emitted by an atom and the fre- 

quency of the rotation of the electron in the atom. Having acquainted 


himself with Bohr’s theory, Einstein remarked: 
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Then the frequency of light does not depend at all on the frequency of the 


electron ... This is an enormous achievement! 


In fact, Bohr’s frequency rule provided a convincing explanation for Ritz’ 
combination principle and permitted calculations of the Rydberg Constant. 


Later, in 1945, referring to Bohr’s theory, Einstein wrote: 


This insecure and contradictory foundation sufficient to enable a man of 
Bohr’s unique instinct and tact to discover the major laws of spectral lines 
and of the electron-shells of the atoms together with their significance for 
chemistry appeared to me like a miracle and appears to me as a miracle 


even today. This is the highest form of musicality in the sphere of thought. 


Experiments conducted in 1914 gave direct experimental evidence for 
the fact that an atom may change its energy only discretely. These were 
the famous experiments by Franck and Hertz on the measurement of the 


electron energy spent on exciting mercury atoms. 


In 1915-1916, Sommerfeld developed Bohr’s theory. In particular, he 
generalized the method of quantization for the case of systems having 
more than one degree of freedom, by changing from circular orbits to 
elliptical ones, and studied the precession of an elliptical orbit in its own 
plane. In 1916 Debye and Sommerfeld came to the conclusion that the 
angular momentum components in the direction of a magnetic field are 
quantized, thus introducing the concept of quantization in space. This 
received excellent confirmation later in the experiments of Stern and 
Gerlach (1921) on the splitting of atomic beams in nonuniform magnetic 


fields. 


Continuing his researches in the field of quantum theory of atoms, 
Bohr in 1918 introduced the famous correspondence principle (in his arti- 
cle On the Quantum Theory of Line Spectra) which he had been using in 
fact since 1913. According to this principle, the laws of quantum physics 
must turn into the laws of classical physics for large values of quantum 
numbers of a system, i.e. if the relative values of the quantum of action 
are negligibly small. It follows from this that classical physics is of great 
importance in the discovery of laws of quantum mechanics, The period 
from 1913 to the early twenties can be considered as the period of the 
creation and development of Bohr’s quantum theory. The achievements 
of this theory are beyond a shadow of doubt. It served as the most impor- 
tant step towards the creation of quantum mechanics. However, in the 
course of the development of Bohr’s theory, its internal contradictions 
associated to a considerable extent with contradictions in the very idea 
of quantization and of “quantum jumps” (see Section 2) became more and 
more apparent. The theory clearly betrayed a stage of crisis. A further de- 


velopment of quantum mechanics demanded an overcoming of this crisis 
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and the introduction of new ideas. As was mentioned in Section 2, those 
contradictions were overcome by introducing the idea of wave-particle 
duality. By eliminating the contradictions of the “old quantum theory” the 
idea of duality marked the beginning of a new stage in the establishment 
of quantum mechanics as a genuine theory in physics, culminating in the 
creation of its apparatus, and, consequently, in the solution of a number of 


problems of atomic and nuclear physics. 


In 1923, Compton discovered the effect, later named after him, of a de- The Growth of Quantum Me- 
crease in the wavelength of X-rays upon scattering by matter. This effect chanics (1923-1927). 
clearly indicated the existence of wave as well as corpuscular properties 
of radiation. Light quanta were introduced into physics as elementary 


particles once and for all under the name of photons. 


In 1923-1924, de Broglie suggested in his doctoral thesis that the idea 
of wave-particle duality should be extended to all microparticles, associ- 
ating both wave and corpuscular characteristics with every particle (see 
Section 2). Later (in 1927), the idea of duality received a convincing confir- 
mation by experiments on electron diffraction conducted simultaneously 
in several different laboratories. In 1925 de Broglie introduced the concept 


of “matter waves” described by the so-called wave function. 


The combination of the idea of quantization with the idea of wave- 
particle duality proved to be very fruitful for the development of quantum 
mechanics, The whole apparatus of quantum mechanics was, in fact, built 
in 1925-1926. Heisenberg in 1925 took the first step in this direction. He 
suggested that every quantized dynamic variable should be represented 
in the form of some matrix whose diagonal elements are essentially the 
experimentally observed values of the variable (the reader is familiar with 
this approach for the case of the Hamiltonian, or energy, matrix which 
has been discussed in detail in the book). Using the correspondence prin- 
ciple, Heisenberg converted the matrix relations into classical relations 
for corresponding variables. In doing so, however, Heisenberg took into 
account the possibility of the commutativity of the product of the ma- 
trices involved. In 1926, Schrödinger in his paper On Quantization as an 
Eigenvalue Problem used the wave concepts to introduce his well-known 
differential equation for a wave function (this equation is now termed 
the Schrödinger equation). The problem of calculating the energy levels 
of a bound microparticle was reduced by Schrédinger to the problem of 


finding eigenvalues. 


After the appearance of Schrédinger’s work, it was at first believed that 
we now had two independent theories, Schrédinger’s wave mechanics and 
Heisenberg’s matrix mechanics. However, already in 1926, Schrödinger 
showed that both these theories are, in fact, equivalent and are just two 


different ways of looking at the same problem. It should be mentioned 
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that the wave formalism of Schrédinger’s theory was very well received, 
since it enabled a solution of quantum-mechanical problem with the help 
of established methods of mathematical physics. Planck’s opinion’? of 
Schrédinger equation is worth noting. According to him, the fundamental 
importance of this differential equation lies not only in the way it has 
been derived, but also in its physical interpretation, whose details are 

still not clear. But most important is the fact that owing to the introduc- 
tion of the quantum law into the well-known system of usual differential 
equations, we get an entirely new method which, with the help of mathe- 
matics, can solve the complicated quantum-mechanical problem. This is 
the first case when a quantum of action, which thus far was impervious to 
all attempts to look at it from the point of view of classical physics, can be 


included in a differential equation. 


While the formalism of Schrédingers theory was readily accepted, the 
problem of the interpretation of wave mechanics and the physical de- 
scription of the concept of the “wave function” remained the subject of 
heated discussion for a long time. In 1926, Born proposed a probability 
interpretation of the wave function: “matter waves” were replaced by 
“probability waves” The impossibility of interpreting the wave function 
as the amplitude of a certain material field (like the electromagnetic or 
the gravitational fields) was recognized even at that time. Planck in 1928, 
commenting on the nature of the wave function, wrote”? that the impos- 
sibility of a physically intuitive representation of this quantity, which 
only has an indirect symbolic meaning, is a direct Consequence of the fact 
that the wave motion takes place not in the ordinary three-dimensional 
space, but in the so called configurational space, where the dimensionality 
is determined by the number of degrees of freedom of the system being 


considered. 


This meant that de Broglie waves could not be interpreted as classical 


waves of some sort. 


The next important step towards the development of quantum me- 
chanics, which revealed its physical and philosophical aspects, was made 
in 1927 by Heisenberg who introduced his famous uncertainty relation. 
Through these relations Heisenberg showed how the concepts of energy, 
momentum, coordinate, etc. should be applied to the case of microparti- 
cles (see Section 3). The appearance of the uncertainty relations marked 
a final break of quantum mechanics from classical determinism, thus es- 
tablishing quantum mechanics as a statistical theory. Starting from the 
uncertainty relations, Bohr formulated in the same year one of the lead- 
ing discoveries of 20th century, i.e. the complementarity principle (see 


Section 16). 


The twenties was undoubtedly a period of most intensive development 
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of quantum mechanics. It is impossible to describe here all the important 


researches conducted during this period. We shall simply mention a few 


points below to supplement the above picture. 


1924 Pauli proposed that an electron could be assigned an ad- 
ditional (fourth) degree of freedom which can have two 


values. Uhlenbeck and Goudsmit, using Pauli’s idea, intro- 


duced in 1925 the concept of a “spinning electron” (in other 


words, the spin concept). 


1924 Bose carried out fundamental studies, which were extended 


by Einstein in the form of a statistical theory for photons 
which came to be known as Bose-Einstein statistics. In the 
framework of this theory, Planck’s formula for black-body 


radiation at last found a complete explanation. 


1925 Pauli formulated his famous exclusion principle for electrons. 

1925 Born and Jordan formulated Heisenberg’s theory in matrix 
form. 

1925 Dirac developed the relativistic theory of electron and 


hydrogen-like atoms. 


1926 Fermi and Dirac carried out fundamental studies on the 
statistical theory of electrons, later called the Fermi-Dirac 


statistics. 


Naturally, the history of the development of quantum mechanics did 
not terminate in 1927. In the following years, quantum mechanics was 
enriched by many new methods, applications, and most importantly by 
further studies of its physical and philosophical aspects. Some problems 
of quantum mechanics (above all, the problem of measurement) are still 
being investigated. However, we shall end our brief historical discussion 
with the year, 1927, assuming that further developments in quantum 


mechanics form the subject of a special study. 


Readers who want to go further into details concerning the origin 


and development of quantum mechanics are advised to refer to these 


sources.?! 
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Appendices 


A Eigenvalues and Eigenfunctions of a Hermitian Operator 


Theorem. An operator has real values if and only if it is Hermitian. Eigen- 


values and Eigenfunctions of a Hermitian Operator 


Proof. From (17.13) we get 


[rv otime-a fv open 
frosty waza fyo eax 
It follows hence that 


frota- [ybyd a- fV Oas 


Thus 
[ve (t-2) yoa-0-4 f yayad 


It can be seen from here that the equality A — A* (denoting the real positive 
eigenvalues) is satisfied if and only if the operator L is Hermitian (i.e. if 
and only if Ê — L’). 














Theorem. The eigenfunctions of a Hermitian operator which correspond to 


different eigenvalues are mutually orthogonal. 


Proof. Using (17.13), we write 
hte Anya Ct = av 
7 
J Vaip de= An f Yin © Yn dx 
J hO de= An f Yn Oa Oda 


It follows from here that 


J vnb- Ë) Yn die = (nAn) f Vh ind 


If the operator L is Hermitian, then L = Li (and also (4%, = Am). In this 


case the last equality acquires the form 


On- Am) | Wa in dx =0 


Since An + Am, we get 
J Yatndx =o 














B Transition from Quantum Mechanics to Classical Mechan- 
ics 


There is a formal analogy between the transition from quantum me- 
chanics to classical mechanics and the transition from wave optics to 
geometrical optics (i.e. the optico-mechanical analogy); see, for example, 
Landau and Lifshitz”, and Blokhintsev”’. A transition to geometrical op- 
tics means that the optical field is described by nearly plane waves of the 
type 

exp (i ®) = exp (iF - o) (B.1) 


It can be seen from (B.1) that 


oP 


k=Vọ @=-— 
ot 


(B.2) 
The vector k is directed along the light beam. At every point in space the 
light beam is perpendicular to the surface of constant phase (i.e. perpen- 


dicular to the surface ® = const). 


When going over to classical mechanics, the quantum-mechanical 
wave function must acquire a form analogous to (B.1); moreover, the 
phase of the wave function will be proportional to the mechanical action 
S: 

®=AS. 


The momentum p and energy E of a particle are expressed through the 


action S in the following way: 


> os 


p=VS, E=-> (B.3) 


The trajectories are lines perpendicular to the surface of constant action 
(in the same way as the light rays are perpendicular to the surface of 
constant phase). Comparing (B.2) and (B.3) and taking into account that 
p = hk, we find that 


p=- 
h 


Thus the quasi-classical wave function is of the form 
iS 
Y =a exp (5) (B.4) 
By using (B.4) it is easy to determine the coefficient y in the expression for 
the momentum operator 
ie A 
Let us consider the equation for the eigenfunctions of the momentum 


operator (the equation py = py) in the quasiclassical case. In this case we 
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m iS 
get p = VS, w =a exp (=) Thus the above equation assumes the form 


iS iS 
yV exp (| = VS exp (5) 
or 
iS i iS 
y exp (5) vsz = VS - exp (5) 
It can be easily seen that iy/h = 1, and, consequently, y = —iħ. The quasi- 


classical case may he used to substantiate the Schrödinger equation. It can 


be easily seen that the well-known classical Hamilton-Jacobi equation 


os 1 
— +—(VS)?+U =0 
ot 2m 


is a limiting case of the Schrödinger equation (20.17). By putting ¥ = 
iS 
exp (3) we get in this case 





ð os iS 
ih—¥ =- exp (! 
ot 


ot h 
h h? 
-— AF = -— V (VY) 
2m 2m 


h fi iS 
h iS 1 iS 
=-—VS - — | + — (VS)? — 
vs oF] + sate (5 
Substituting these results into (20.17) and neglecting terms containing 


Planck’s constant in this expression for AY, we get the equation 


as So Niven qui is 
= ex = ex emt ex = 
at Pln) 2m P\ P\ a 


By cancelling out the factor exp(iS/7), this equation becomes the Hamilton- 





Jacobi equation. 


C Commutation Relations 


We shall show how the commutation relations (20.28)-(20.30) can be 
derived. To start with we remark that the equality M = (7 x p) may be 


3 3 
Ni = X, 9, eikn fe Pn (C.1) 


where e;kn is a unit antisymmetric tensor of the 3rd rank (e123 = €231 = 


written in the form 





€312 = 1, €132 = €321 = €213 = —1, all the ether components of the tensor are 


equal to zero). 


Let us consider the commutator [Mi, ñj]. By using (C.1), we can write 
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this commutator in the following form: 
[Mi fj] = Mið; - ôf; Mi 
=Y, OD) errn (f Pa fy — y fi Pa) 
k n 
Taking into account that ^; fk = rk fj, we get 


[Mi, fj] =% 2e Cikn Vk [Pn ñj] 


According to (20.26), [Bn F;| = —iħônj. This leads to the required relation 
(20.28): 


[Mi, ñj] =iħ 2,4 Cikj fk 
= E Cijk Tk 
k 


This relation means that 


= [Mo, | = [Ms, 3 | =0 


Next let us consider the commutator [Mi, Bil. By using (C.1), we can 


write it in the form 
[M;, Pj = 21 D4 enn ( Tk Pn Pj — Pi Yk Pn) 


Taking into account that pn Pj = Pj Pn, we get 
) | = >, » Cikn [r Pi] Pn 
k n 


According to (20.26), [o Bil = ihd,;. As a result, we get the required 
relation (20.29): 
[M;, pj] =ih 2 eijn Pn 


Thus 
[M, PA = ih ps 
[Mo, Ps| = ihp, 
[Ms, Pil = ih p 
[Mi Pi] = [M p2] = [Ms, ps] = 0 


Going over to the commutator [M;, M | , we shall restrict ourselves for 
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simplicity to the case wheni = 1, j = 2. Using 


Mı = É ps — Ê p2 
My = 73 pi - p3 
we can write 
[Mi Mz] = (È Bs - Ê Po) (B Pi — f Bs) — (73 Bi — A Bs) X (72 Bs — h po) 
= fo ps ÊB Pi — fo ps Ô P3 — h Po. Ps Pit fs Pot ps — Ê Pile ps 
+ 6 pi B Po + Ô p3 h ps — Â p33 p2 
Note that Po p3 ri p3 = Fi p3 Po p3 and P3 p2 P3 pi = Ê pi Ê P2 By taking this 
into account, we write the expression for [M., Mp): 
[Mi, Mz] = (72 ps B Pi — Ê Bi P Bs) + (f Pot Ps — À Ps Ps po) 
= FP (P 6 — Ê p3) +71 Po (73 Ps — Ps 7s) 
Since Ê p3 — p Ê = ih, we finally obtain 
[Mi, Mo] = ih (A p2- Ê pi) 
= ih Mg 
Proceeding in the same way, it can be easily seen that 
|M, Ms] = ih M, 
|M, Mı] = ih M 


All these results can be combined in the relation (20.30). 


D Commutation of Operators M?, M; 


Let us consider the case when i = 1: 
[M?, M] = (My + M + Mg) My — (My My + M; + M3) 
= (My + M3) My — Mi (Mz + M3) 
= MMM, + M;M;Mı — M, M2 M — M, M3 MB 
= (MM, M; — ih MM3) + (M3M, Mg + ih M3 Mo) 
— (MMM + ih M3M2) — (M Mı M; — ih M2 Mb) 
=0 


Similarly, we get [M?, M2] = 0 and [M?, M3] = 0. This proves the relation 
(20.31). 
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E Some Special Functions 


Legendre Polynomials: The Legendre polynomials P; (x) may be defined 


as the coefficients in the expansion of the function 


1 
H (x, t) = ———, |t |< 1 
V1i+ 0? — 2tx 
as a power series in t: 
1 = à 
—— = J Pi (x)t (E.1) 


V1+t?-2tx To 


(-1 < x < 1, lis a non-negative integer). 
The polynomials P; (x) are the solutions of the differential equation 
(1-x*)y” — 2xy’+1(1+1)y=0, (1=0,1,2,...) (E.2) 


and satisfy the condition y (1) = 1. They can he written in the form 


ie [e = 1)'| (E.3) 


Pi (x) = ora aa 


We note that 
Pi(1)=1, Py (=x) = (-1)'P; (x) 
Po(x) =1 
Pi(x) =x 
P(x) = 53x =1) 
P3(x) = 5 (5x — 3x) 


The Legendre polynomials are orthonormalized: 


1 


J P; (x) Py (x) dx = 


-1 


2 
21+1 





Ow (E.4) 


The basic recurrence relations are 


(L+ 1) Pry, +1 Py — x (21+ 1) P) =0 


P! 


P;— P 1-1 


g — 
l+1 


+2xP;=0 
Pia Py = (21+ 1)P; 
Associated Legendre Functions: The associated Legendre functions 
pL, (x) are defined through the Legendre polynomials P;(x) in the follow- 


ing way: 





m d 
py" (x) =(1- x")? 


smi (x) (E.5) 
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(m and l are positive integers, m < 1), 


pæ) = P(x); py =3X5Xx7...(21- 1) (1- x’)! 
pi(x) =(1-x”)? 
p3(x) = 3x(1 — x2)! 


plx) = 30- x°)... 


The functions p}, (x) satisfy the differential equation 








(1—x")y” — 2xy' + | I(l +1) — | y=0 (E.6) 
They are orthonormalized: 
1 
[orc woop ey dx= o E 
-1 


The basic recurrence relations are 


(21+1) xp (x) = (1-m+1)pr (x) + (1+ mp (x) 
Pint (x) — pet? (x) = (21+ 1) V1- x?pr (x) 


Harmonic Functions. The harmonic function Yjn(0, @) is defined as the 


Im] 


product of the Legendre associated function p, 


exp (imọ): 


l l- ! 
Yin (6, @) = Ey (cos 6) exp (img) (Œ) 


(Lis a non-negative integer, m = 0, +1, +2,... + l, 0 and ọ are angular 


(cos 0) and the function 


coordinates, 0 < 0 < 2,0 < ọ < 2r). Expressions for the first few 





harmonic functions are given in Section 21 (see (21.26a) - (21.26c)). 
The harmonic functions satisfy the differential equation 
Aggy t+l(l+ 1)y =0 (E.9) 


where 





A 1 ð P 1 of. 6 

= sin 
O° ~ sin? @ ag? sinb a0 30 
The spherical functions are orthonormalized: 


2m 1 


J i Yin (9, P) Yrm (0, o) sin @ dO dọ = dy mw (E.10) 
0 0 


Hermite Polynomials. The Hermite polynomials H,,(x) may be defined 
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as expansion coefficients of the function exp (2xt — t°): 


exp (2xt — t?) = DS Hn wE (E.11) 


n=0 


(—œ < x < œ, nis a non-negative integer). 


Hermite polynomials are described by the expression: 


n 





AE E eee 


Tun P (=x°) (E.12) 


Some of the values are 


Hy (~x) = (-1)" Hn (x) 
Ho (x) =1 
Hy (x) = 2x 
Hy (x) = 4x? - 2 
H; (x) = 8x? — 12x 


The polynomials H, (x) satisfy the differential equation 
y” — 2xy’+2ny=0 (n=0,1,2,...) (E.13) 


They are orthonormalized: 
7 Hy (x) Hw (x) exp (-x°) dx = Vr 2" n! nw (E.14) 


The basic recurrence relations are 


2 x Hy — 2n An-1 = Ant 
2n Hp- = Hy, 


Modified Hermite Polynomials. (Hermite-Gauss Polynomials). The 
Hermite-Gauss polynomials h,,(x) are defined in terms of the Hermite 


polynomials H,,(x) by the formula 


1 x? 
hy (x) = ae (x) exp (-5} (E.15) 


The condition of orthonormalization for them has the form 


0 


J hn © hw (x) dx = nw (E.16) 


—oo 


These polynomials satisfy the differential equation 


y” +(2n+1-x*)y =0 (E.17) 
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F Linear Harmonic Oscillator 


By taking (22.1) into account, we can write Schrédinger’s equation 


2m mo" 





0" (x) + Eg (x) - x? g(x) =0 E) 
We introduce the notation 
mo 2E 
aN E aa 
E= T" ho 


In the new notation, (F.1) acquires the form 


yg" (A) +A- Eeo (F.2) 


The differential equation for Hermite-Gauss polynomials is of the same 


form (see (E.17)). Moreover, À = 2n + 1 and, consequently 
1 
E=ho(ns z) n = 0,1,2, 


The wave function ọn (&) is apart from a constant factor a Hermite-Gauss 
polynomial: 


Pn (È) = Chn (8) 


The factor C is determined from the normalization condition for the wave 


function @p (x): 


ove) 


[ieoa=t 


Thus 


0 


1= [oho ax 


—oo 


oe) 


dx 
- f ho Fa 
-JZ foo 
i co 
-Ze f eOk 


Taking (E.16) into account we conclude that 


h v mo 
—C =1, d, tly, C= /— 
F and, consequently, 7 
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Thus we arrive at the relation (22.3) 


cope E et oe 
“fon no 
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Quantum transitions, 32, 224 
anomalies of, 35 


computation of probability, 224 


Rayleigh-Jeans law, 237 
Reduction of a wave packet, 40 
Representation, 167 
coordinate, 14, 141 
coorindate, 184 
Dirac’s, 14 
Heisenberg, 14, 229-231 


interaction, 229, 232 
momentum, 141 
Schrodinger, 14, 229, 230 
transition from coordinate to momen- 
tum, 14 
Resonances, 27 
Rest mass, 25 


Rydberg’s constant, 32, 34 


Scattering 
of microparticles, 79 
of neutrons by a crystal, 94 
coherent, 95 
incoherent, 95 
Schrödinger equation 
in momentum representation, 214 
time dependent, 187 
Schrédinger’s equation, 186, 201 
application of, 194 
in energy representation, 225 
time dependent 
hydrodynamic analogy, 201 
Spatial parity, 45, 190 
and angular momentum, 192 
Special functions, 207, 250 
Spectral terms, 32 
Spectrum 
discrete, 33 
line, 32 
Spherical functions, 191, 199, 207 
Spin, 25 
electron, precession of, 133 
Spontaneous emission probability, 92 
Stark’s effect, 209 
States of a microparticle 
basic, 102, 144, 172 
condition of orthonormalization of, 
102 
mutually independent, 100 
mutually orthogonal, 100, 147 
stationary, 33, 124 
transition from one system to an- 
other, 128, 174 
Statistical ensemble, 120 
Superposition of states, 96, 106 
destruction in the process of measure- 
ments, 109 
passage of photons through polariz- 
ers, 102 
quantum transitions, 100, 224 
role of analyzer, 108 


total and partial distinguishability, 
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101 
transition from one representation to 
another, 225 


Transition amplitudes, rule of addition, 
82 
Transition probability, 80, 226, 228 
amplitude, 81, 85 
product, 82 
sum, 82 
Transmission coefficient of a barrier, 
205 
Tunneling effect, 52 


Uncertainty relations, 13, 40, 242 
diffraction of microparticles, 42 
evaluation of absorption edge, 49 
evaluation of ground-state energy of 
hydrogen atom, 47 

evaluation of zero-point oscillations, 
48 

trajectory of a micreparticle, 51 

Unitary invariance, 183 
of commutation relations, 214 

Unitary invariants, 170 


Universal dynamic variables, 30 


Vector 
of probability flow density, 203 
wave, 39 

Vector analogy, 174, 233 

Vector, of state, 136 

Virtual microparticle, 61 


Virtual transitions, 46, 61 


Wave function, 137, 139, 241, 242 
of a freely moving microparticle, 140 
of an electron in a crystal (Bloch 

function), 209 

Wave packet, 40, 55 

Wave vector, 39 

Wave, pilot, 55 

Wave-particle duality, 36, 40, 56 

Wien’s law, 237 


Young’s experiment, 72 


Zeeman’s effect, 209 


Zero-point oscillations, 48 
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